(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property 
Organization 
International Bureau 

(43) Internationa] Publication Date 
29 December 2004 (29.12,2004) 




PCT 



(10) International Publication Number 

WO 2004/113522 Al 



(51) International Patent Classification 7 : 

9/50, 9/64, 9/00, A61K 38/48 



C12N 9/56, 



(21) International Application Number: 

PCT/EP2004/051173 

(22) International Filing Date: 18 June 2004 (18.06.2004) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 
03013819.2 
03025851.1 
03025871.9 
04003058.7 



18 June 2003 (18.06.2003) EP 

10 November 2003 (10.11.2003) EP 

1 1 November 2003 (11.11 .2003) EP 
1 1 February 2004 ( 1 1 .02.2004) EP 



(71) Applicant (for all designated States except US): DIREVO 
BIOTECH AG [DE/DE]; Nattermannaliee I, 50667 Koln 
CDE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only)-. HAUPTS, Ul- 
rich [DE/DE]; c/o Direvo Biotech AG, 50667 Koln 
(DE). KOLTERMANN, Andre [DE/DE] ; c/o Direvo 
Biotech AG, 50667 Kain (DE). SCHEIDIG, Andreas 
[DE/DE]; c/o Direvo Biotech AG, 50667 Kdln (DE). 
VOETSMEIER, Christian [DE/DE]; c/o Direvo Biotech 
AG, 50667 Koln (DE). KETTLING, Ulrich [DE/DE]; c/o 
Direvo Biotech AG, 50667 Koln (DE). 

(74) Agents: HELBING, Jorg et al.; Von Kreisler Selting 
Werner, Postfach 10 22 41, 50462 K51n (DE). 



(81) Designated States (unless otherwise indicated, for every 
kind of national protection available)'. AE, AG, AL, AM, 
AT, AU, AZ, BA, BB, BG, BR, BW, BY, BZ. CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EC, EE, EG, ES, FI, 
GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, 
KG, KP, KR, KZ, LC, LK T LR, LS, LT, LU, LV, MA, MD, 
MG, MK, MN, MW, MX, MZ, NA, NI, NO, NZ, OM, PG, 
PH, PL, PT, RO, RU, SC, SD, SE, SG, SK, SL, SY, TJ, TM, 
TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, YU, ZA, ZM, 
ZW. 

(84) Designated States (unless otherwise indicated, for every 
kind of regional protection available): ARIPO (BW, GH, 
GM, KE, LS, MW, MZ, NA, SD, SL, SZ, TZ, UG, ZM, 
ZW), Eurasian (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European (AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, 
FR, GB, GR, HU, IE, IT, LU, MC, NL, PL, PT, RO, SE, SI, 
SK, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, 
GW, ML, MR, NE, SN, TD, TG). 

Published: 

— with international search report 

— before the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments 

— with sequence listing part of description published sepa- 
rately in electronic form and available upon request from 
the International Bureau 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



IT) 



^ (54) Title: NEW BIOLOGICAL ENTITIES AND THE PHARMACEUTICAL OR DIAGNOSTIC USE THEREOF 

(57) Abstract: The present invention provides method for the treatment of a disease by applying a medicament comprising a pro- 
tease with a defined specificity is capable to hydrolyze specific peptide bonds within a target substrate related to such disease. The 
proteases with such a defined specificity can further be used for related therapeutic or diagnostic purposes. 



WO 2004/1 13522 



PCT/EP2004/051173 



1 



New Biological Entities and the Pharmaceutical or Diagnostic Use 

Thereof 

» 

The present invention provides methods for the treatment of a disease by 
applying a medicament comprising a protease with a defined specificity is 
capable to hydrolyze specific peptide bonds within a target substrate related to 
such disease. The proteases with such a defined specificity can further be used 
for related therapeutic or diagnostic purposes. 

Background 

Academic and industrial research continuously searches for functional proteins to 
be used as therapeutic, research, diagnostic, nutritional, personal care or 
industrial agents. Today, such functional proteins can be classified mainly into 
two categories: natural proteins and engineered proteins. Natural proteins, on 
the one hand, are discovered from nature, e.g. by screening natural isolates or 
by sequencing genomes from diverse species. Engineered proteins, on the other 
hand, are typically based on known proteins and are altered in order to acquire 
modified functionalities. The present invention discloses engineered proteins with 
novel functions as compared to the starting components. Such proteins are 
called NBEs (New Biologic Entities). The NBEs disclosed in the present invention 
are engineered enzymes with novel substrate specificities or fusion proteins of 
such engineered enzymes with other functional components. 

Specificity is an essential element of enzyme function. A cell consists of 
thousands of different, highly reactive catalysts. Yet the cell is able to maintain a 
coordinated metabolism and a highly organized three-dimensional structure. This 
is due in part to the specificity of enzymes, i.e. the selective conversion of their 
respective substrates. Specificity is a qualitative and a quantitative property: the 
specificity of a particular enzyme can vary widely, ranging from just one 
particular type of target molecules to all molecular types with certain chemical 
substructures. In nature, the specificity of an organism's enzymes has been 
evolved to the particular needs of the organism. Arbitrary specificities with high 
value for therapeutic, research, diagnostic, nutritional or industrial applications 
are unlikely to be found in any organism's enzymatic repertoire due to the large 
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space of possible specificities. The only realistic way of obtaining such 
specificities is their generation de novo. 

When comparing enzymes with binders, a paradigm of specificity is given by 
antibodies recognizing individual epitopes as small distinct structures within large 
molecules. The naturally occurring vast range of antibody specificities is 
attributed to the diversity generated by the immune system combined with 
natural selection. Several mechanisms contribute to the vast repertoire of 
antibody specificity and occur at different stages of immune response generation 
and antibody maturation (Janeway, C et ai. (1999) Immunobiology, Elsevier 
Science Ltd., Garland Publishing, New York). Specifically, antibodies contain 
complementarity determining regions (CDRs) which interact with the antigen in a 
highly specific manner and allow discrimination even between very similar 
epitopes. The light as well as the heavy chain of the antibody each contribute 
three CDRs to the binding domain. Nature uses recombination of various gene 
segments combined with further mutagenesis in the generation of CDRs. As a 
result, the sequences of the six CDR loops are highly variable in composition and 
length and this forms the basis for the diversity of binding specificities In 
antibodies. A similar principle for the generation of a diversity of catalytic 
specificities is not known from nature. 

Catalysis, i.e. the increase of the rate of a specific chemical reaction, is besides 
binding the most important protein function. Catalytic proteins, i.e. enzymes, are 
classified according to the chemical reaction they catalyze. 

Transferases are enzymes transferring a group, for example, the methyl group or 
a glycosyl group, from one compound (generally regarded as donor) to another 
compound (generally regarded as acceptor). For example, glycosyltransferases 
(EC 2.4) transfer glycosyl residues from a donor to an acceptor molecule. Some 
of the glycosyltransferases also catalyze hydrolysis, which can be regarded as 
transfer of a glycosyl group from the donor to water. The subclass is further 
subdivided into hexosyltransferases (EC 2.4.1), pentosyltransferases (EC 2.4.2) 
and those transferring other glycosyl groups (EC 2.4.99, Nomenclature 
Committee of the International Union of Biochemistry and Molecular Biology (NC- 
IUBMB)). 
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Oxidoreductases catalyze oxido- reductions. The substrate that is oxidized is 
regarded as hydrogen or electron donor. Oxidoreductases are classified as 
dehydrogenases, oxidases, mono- and dioxygenases. Dehydrogenases transfer 
hydrogen from a hydrogen donor to a hydrogen acceptor molecule. Oxidases 
react with molecular oxygen as hydrogen acceptor and produce oxidized products 
as well as either hydrogen peroxide or water. Monooxygenases transfer one 
oxygen atom from molecular oxygen to the substrate and one is reduced to 
water. In contrast, dioxygenases catalyze the insert of both oxygen atoms from 
molecular oxygen into the substrate. 

Lyases calalyze elimination reactions and thereby generate double bonds or, in 
the reverse direction, catalyze the additions at double bonds. Isomerases 
catalyze intramolecular rearrangements. Ligases catalyze the formation of 
chemical bonds at the expense of ATP consumption. 

Finally, hydrolases are enzymes that catalyze the hydrolysis of chemical bonds 
like C-O or ON. The EC. classification for these enzymes generally classifies 
them by the nature of the bond hydrolysed and by the nature of the substrate. 
Hydrolases such as lipases and proteases play an important role in nature as well 
in technical applications of biocatalysts. Proteases hydrolyse a peptide bond 
within the context of an oligo- or polypeptide. Depending on the catalytic 
mechanism proteases are grouped into aspartic, serin, cysteine, met alio- and 
threonine proteases (Handbook of proteolytic enzymes. (1998) Eds: Barret, A; 
Rawling, N.; Woessner, J.; Academic Press, London). This classification is based 
on the amino acid side chains that are responsible for catalysis and which are 
typically presented in the active site in very similar orientation to each other. The 
scissile bond of the substrate is brought into register with the catalytic residues 
due to specific interactions between the amino acid side chains of the substrate 
and complementary regions of the protease (Perona, J. & Craik, C (1995) Protein 
Science, 4, 337-360). The residues on the N- and C-terminal side of the scissile 
bond are usually called P^ P 2 , B etc and R\ F£\ and the binding pockets 
complementary to the substrate Si, Sz, S3 and Si\ $2', Ss\ respectively 
(nomenclature according to Schlechter & Berger, Biochem. Biophys. Res. 
Commun. 27 (1967) 157-162). The selectivity of proteases can vary widely from 
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being virtually nonselective - e.g. the Subtllisins - over a strict preference at the 
Pi position - e.g. Trypsin selectively cutting on the G-terminal side of arginine or 
lysine residues - to highly specific proteases - e.g. human tissue-type 
plasminogen activator (t-PA) cleaving at the Oterminal side of the arginine in 
the sequence CPGRVVG (Ding, L et al. (1995) Proc. Natl. Acad. Scl. USA 92, 
7627-7631; Coombs, Get al. (1996) d Biol. Chem. 271 , 4461-4467). 

The specificity of proteases, i.e. their ability to recognize and hydrolyze 
preferentially certain peptide substrates, can be expressed qualitatively and 
quantitatively. Qualitative specificity refers to the kind of amino acid residues 
that are accepted by a protease at certain positions of the peptide substrate. For 
example, trypsin and t-PA are related with respect to their qualitative specificity, 
since both of them require at the P 1 position an arginine or a similar residue. On 
the other hand, quantitative specificity refers to the relative number of peptide 
substrates that are accepted as substrates by the protease, or more precisely, to 
the relative k^/k^ ratios of the protease for the different peptides that are 
accepted by the protease. Proteases that accept only a small portion of all 
possible peptides have a high specificity, whereas the specificity of proteases 
that, as an extreme, cleave any peptide substrate would theoretically be zero. 

Comparison of the primary, secondary as well as the tertiary structure of 
proteases (Fersht, A., Enzyme Structure and Mechanism, W. H. Freeman and 
Company, New York, 1995) allows identification of classes showing a high degree 
of conservation (Rawlings, N.D. & Barrett, A.J. (1997) In: Proteolysis in Cell 
Functions Eds. Hopsu-Havu,V.K.; Jarvinen,M.; Kirschke,H, pp. 13-21, I OS Press, 
Amsterdam). A widely accepted scheme for protease classification has been 
proposed by Rawlings & Barrett (Handbook of proteolytic enzymes. (1998) Eds: 
Barret, A; Rawling, N.; Woessner, J.; Academic Press, London). For example, the 
serine proteases family can be subdivided into structural classes with 
chymotrypsin (class S1), subtilisin (class S8) and carboxypeptidase (class SC) 
folds, each of which includes nonspecific as well as specific proteases (Rawlings, 
N.D. & Barrett, A.J. (1994) Methods EnzymoL 244, 19-61). This applies to other 
protease families analogously. An additional distinction can be made according to 
the relative location of the cleaved bond in the substrate. Carboxy- and 
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aminopeptidases cleave amino acids from the O and N-terminus, respectively, 
while endopeptidases cut anywhere along the oligopeptide. 

Many applications would be conceivable if enzymes with a basically unlimited 
spectrum of specificities were available. However, the use of such enzymes with 
high, low or any defined specificity is currently limited to those which can be 
isolated from natural sources. The field of application for these enzymes varies 
from therapeutic, research, diagnostic, nutritional to personal care and industrial 
purposes. 

Enzyme additives in detergents have come to constitute nearly a third of the 
whole industrial enzyme market. Detergent enzymes include proteinases for 
removing organic stains, lipases for removing greasy stains, amylases for 
removing residues of starchy foods and cellulases for restoring of smooth surface 
of the fiber. The best-known detergent enzyme is probably the nonspecific 
proteinase subtilisin, isolated from various Bacillus species. 

Starch enzymes, such as amylases, occupy the majority of those used in food 
processing. While starch enzymes include products that are important for textile 
desizing, alcohol fermentation, paper and pulp processing, and laundry detergent 
additives, the largest application is for the production of high fructose corn syrup. 
The production of corn syrup from starch by means of industrial enzymes was a 
successful alternative to acid hydrolysis. 

Apart from starch processing, enzymes are used for an increasing range of 
applications in food. Enzymes in food can improve texture, appearance and 
nutritional value or may generate desirable flavours and aromas. Currently used 
food enzymes in bakery are amylase, amyioglycosidases, pentosanases for 
breakdown of pentosan and reduced gluten production or glucose oxidases to 
increase the stability of dough. Common enzymes for dairy are rennet (protease) 
as coagulant in cheese production, lactase for hydrolysis of lactose, protease for 
hydrolysis of whey proteins or catalase for the removel of hydrogen peroxides. 
Enzymes used in brewing process are the above named amylases, but also 
cellulases or proteases to clarify the beer from suspended proteins, in wines and 
fruit juices, cloudiness is more commenly caused by starch and pectins so that 
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amylases and pectinases increase yield and clarification. Papain and other 
proteinases are used for meat tenderizing. 

Enzymes have also been developed to aid animals in the digestion of feed. In the 
western hemisphere, corn is a major source of food for cattle, swine, and 
poultry. In order to improve the bioavailability of phosphate from corn, phytase 
is commonly added (Wyss, M. et al., Biochemical characterization of fungal 
phytases {myo- inositol hexakisphosphate phosphohydrolases); Catalytic 
properties. Applied & Environmental Microbiology 65, 367-373 (1999)). 
Moreover, phytate hydrolysis has been shown to bring about improvements in 
digestibility of protein and absorption of minerals such as calcium (Bedford, M.R. 
& Schulze, H., Exogenous Enzymes for Pigs and Poultry [Review]. Nutrition 
Research Reviews 11, 91-114 (1998)). Another major feed enzyme is xylanase. 
This enzyme is particularly useful as a supplement for feeding stuff comprising 
more than about 10% of wheat barley or rye, because of their relatively high 
soluble fiber content. Xylanases cause two important actions: reduction of 
viscosity of the intestinal contents by hydrolyzing the gel- like high molecular 
weight arabinoxylans in feed (Murphy, T et al., Effect of range of new xylanases 
on in vitro viscosity and on performance of broiler diets. British Pultry Science 
44, S16-S18 (2003)) and break down of polymers in cell wallswhich improve the 
bioavailability of protein and starch. 

Biotech research and development laboratories routinely use special enzymes in 
small quantities along with many other reagents. These enzymes create a 
significant market for various enzymes. Enzymes like alkaline phosphatase, 
horseradish peroxidase and luciferase are only some examples. Thermostable 
DIMA polymerases like Taq polymerase or restriction endonucleases 
revolutionized laboratory work. 

The use of enzymes in the diagnosis of disease is another important benefit 
derived from the intensive research in biochemistry. Within the recent past few 
years that interest in diagnostic enzymology has increased and there are still 
large areas of medical research in which the diagnostic potential of enzyme 
reactions has not been explored at all. Common enzymes used for clinical 
diagnosis are acid phosphatase, alanine aminotransferase, alkaline phosphatase, 
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amylase, angiotensin converting enzymes, aspartate aminotransferase, 
cholinesterase, creatinine kinase, gamma glutamyltransferase, lactate 
dehydrogenaseor rennin. 

Therapeutic enzymes are a particular class of drugs, categorized by the FDA as 
biologicals, with a lot of advantages compared to other, especially non-biological 
pharmaceuticals. Examples for successful therapeutic enzymes are human 
clotting factors like factor VIII and factor IX for human treatment. In addition, 
digestive enzymes are used for various deficiencies in human digestive 
processes. Other examples are t-PA and streptokinase for the treatment of 
cardiovascular disease, beta-glucocerebrosidase for the treatment of Type I 
Gaucher disease, L- asparaginase for the the treatment of acute lymphoblastic 
leukemia and DNAse for the treatment of cystic fibrosis. An important issue in 
the application of proteins as therapeutics is their potential immunogenicity. To 
reduce this risk, one would prefer enzymes of human origin, which narrows down 
the set of available enzymes. The provision of designed enzymes, preferably of 
human origin, with novel, tailor-made specificities would allow the specific 
modification of target substrates at will, while minimizing the risk of 
immunogenicity. A further advantage of highly specific enzymes as therapeutics 
would be their lower risk of side effects. Due to the limited possibility of specific 
interactions between a small molecule and a protein, binding to non-target 
proteins and therefore side effects are quite common and often cause 
termination of an otherwise promising lead compound. Specific enzymes, on the 
other hand, provide many more contact sites and mechanisms for substrate 
discrimination and therefore enable a higher specificity and thereby less side 
activities. 

Proteases represent an important class of therapeutic agents (Drugs of today, 
33, 641-648 (1997)). However, currently the therapeutic protease is usually a 
substitute for insufficient acitivity of the body's own proteases. For example, 
factor VII can be administered in certain cases of coagulation deficiencies of 
bleeders or during surgery (Heuer L; Blumenberg D. (2002) Anaesthesist 
51:388). Tissue-type plasminogen activator (t-PA) is applied in acute cardiac 
infarction, initializing the dissolution of fibrin clots through specific cleavage and 
activation of plasminogen (Verstraete, M. et al. (1995) Drugs, 50, 29-41). So far 
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a protease with taylor-made specificity is generated to provide a therapeutic 
agent that specifically activates or inactivates a disease related target protein. 

Monoclonal antibodies represent another important biological class of substances 
with therapeutic capabilities. One of the main antibody targets are tumor 
necrosis factors (TISIFs) which belong to the family of cytokines. TNFs play a 
major role in the inflammation process. As homotrimers they could bind to 
receptors of nearly every cell. They activate a multiplicity of cellular genes, 
multiple signal transduction mechanisms, kinases and transcription factors. The 
most important TNFs are TNF-alpha and TNF-beta. TNF-alpha is produced by 
macrophages, monocytes and other cells. TNF-alpha is an inflammation 
mediator. Therefore, research of the last decade has been focused on TNF-alpha 
inhibitors like monoclonal antibodies as possible therapeutics for different 
therapeutic indications like Rheumatoid Arthritis, Crohn's disease or Psoriasis 
(Hamilton et al. (2000) Expert Opin Pharmacother, 1 (5): 1041-1052). One of 
the major disadvantages of monoclonal antibodies are their high costs, so that 
new biological alternatives are of great importance. 

There are a lot of examples for engineered enzymes in literature. Fulani et al. 
(Fulani F. et al. (2003) Protein Engineering 16, 515-519) describe a rhodanase 
(thiosulfat:cyanide sulfurtransferase) from Azotobacter vinelandii which has a 
catalytic domain structurally related to catalytic subunit of Cdc25 phosphatase 
enzymes. The difference in catalytic mechanism depends on the different size of 
the active site. Both rhodanase and phosphatase are highly specific on different 
substrates (sulfate vs. phosphate). The catalytic mechanism of the rhodanase 
could be shifted towards serine/threonine phosphatase by single- residue 
insertion. Therefore, Fulani et al. give a single example for the change of a 
catalytic mechanism by structural comparison and sequence alignment of 
naturally known enzymes from different enzyme classes but lack an indication of 
how to generate a user-definable substrate specificity while keeping the same 
catalytic mechanism . 

The thioredoxin reductase described by Briggs et al. (WO 02/090300 A2) has an 
altered cof actor specificity which preferably binds NADPH compared to NADH. 
Thus, both enzymes, the starting point as well as the resulting engineered 
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enzyme are highly specific towards different substrates. The methods to achieve 
such an altered substrate specificity are either computational processing methods 
or sequence alignments of related proteins to define variable and conserved 
residues. They ail have in common that they are based on the comparison of 
structures and sequences of proteins with known specificities followed by the 
transfer of the same to another backbone. 

There are other examples of specif icity- engineered enzymes and, in particular, of 
proteases which have been published in the literature. None of these examples, 
however, provides a means for generating novel specificites compared to the 
specificity of the starting material used within the described methods. The 
methods range from structure-directed single point mutations (Kurth, T. et al. 
(1998) Biochemistry 37, 11434-11440; Ballinger, M et al. (1996) Biochemistry, 
35:13579-13585), exchange of surface loops between two specific proteases 
(Horrevoets et al. (1993) J. BioL Chem. 268, 779-782), to random mutagenesis 
either regio-selectively or across the whole gene combined with in-vitro or in- 
vivo selection (Sices, H. & Kristie, T. (1998) Proc. Natl. Acad. ScL USA, 95, 
2828-2833). 

The rational design of protease specificity is limited to very few examples. This 
approach is severely limited by the insufficient understanding of the complexities 
that govern folding and dynamics as well as structure-function relationships in 
proteins (Corey, M.J. & Corey, E. (1996) Proc. Natl. Acad. Sci. USA, 93:11428- 
11434). It is therefore difficult to alter the primary amino acid sequence of a 
protease in order to change its activity or specificity in a predictive way. In a 
successful example, Kurth et al. engineered trypsin to show a preference for a 
dibasic motive (Kurth, T. et al. (1998) Biochemistry, 37:11434-11440). In 
another example, Hedstrom et al. converted the substrate specificity of 
trypsin to that of chymotrypsin (Hedstrom, L. et al. (1992) Science, 255:1249- 
1253). This is an example where a known property was transferred from one 
backbone to another. 

Ballinger et al. (WO 96/27671) describe subtilisin variants with combination 
mutations (N62D/G166D, and optionally Y104D) having a shift of substrate 
specificity towards peptide or polypeptide substrates with basic amino acids at 
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the PI, P2 and P4 positions of the substrate. Suitable substrates of the variant 
subtilisin were revealed by sorting a library of phage particles (substrate phage) 
containing five contiguous randomized residues. These subtilisin variants are 
useful for cleaving fusion proteins with basic substrate linkers and processing 
hormones or other proteins (in vitro or in vivo) that contain basic cleavage sites. 
The problems associated with rational redesign of enzymes can partially be 
overcome by directed evolution (as disclosed in PCT/EP03/04864). These studies 
can be classified by their expression and selection systems. Genetic selection 
means to produce inside an organism an enzyme, e.g. a protease, which is able 
to cleave a precursor protein which in turn results in an alteration of the growth 
behavior of the producing organism. From a population of organisms with 
different proteases those can be selected which have an altered growth behavior. 
This principle was for example reported by Davis et al. (US 5258289, WO 
96/21009). The production of a phage system is dependent on the cleavage of a 
phage protein which only can be activated in the presence of a proteolytic 
enzyme which is able to cleave the phage protein. Other approaches use a 
reporter system which allows a selection by screening instead of a genetic 
selection, but also cannot overcome the intrinsic insufficiency of the intracellular 
characterization of enzymes. 

Systems to generate enzymes with altered sequence specificities with self- 
secreting enzymes are also reported. Duff et al. (WO 98/11237) describe an 
expression system for a self-secreting protease. An essential element of the 
experimental design is that the catalytic reaction acts on the protease itself by an 
autoproteolytic processing of the membrane-bound precursor molecule to release 
the matured protease from the cellular membrane into the extracellular 
environment. Therefore, a fusion protein must be constructed where the target 
peptide sequence replaces the natural cleavage site for autoproteolysis. 
Limitations of such a system are that positively identified proteases will have the 
ability to cleave a certain amino acid sequence but they also may cleave many 
other peptide sequences. Therefore, high substrate specificity cannot be 
achieved. Additionally, such a system is not able to control that selected 
proteases cleave at a specific position in a defined amino acid sequence and it 
does not allow a precise characterization of the kinetic constants of the selected 
proteases (kcat, K M ). 
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A method has been described that aims at the generation of new catalytic 
activities and specificities within the a/B-barrel proteins (WO 01/42432; Fersht et 
al, Methods of producing novel enzymes; Altamirano et al. (2000) Nature 403, 
617-622). The a/B-barrel proteins comprise a large superfamily of proteins 
accounting for a large fraction of all known enzymes. The structure of the 
proteins is made from a/B-barrel surrounded by cc-helices. The loops connecting 

B-strands and helices comprise the so-called lid-structure including the acitve 
site residues. The method is based on the classification of a/B-barrel proteins 
into two classes based on the catalytic lid structure. An extensive comparison of 
cc/8-barre! protein structures led the authors to the conclusion that the substrate 
binding and specificity is primarily defined by the barrel structure while the 
specificity of the chemical reaction resides within the loops. It is suggested that 
barrels and lid structures from different enzymes can be combined to generate 
new enzymatic activities and to provide a starting point to fine tune the 
properties by targeted or randomized mutagenesis and selection. The method 
does not provide for the generation of user-defined specificity. 

In summary, it is clear that there are many possible applications in the fields of 
therapeutics, research and diagnostics, industrial enzymes, food and feed 
processing, cosmetics and other areas that would become possible by the 
availability of enzymes with a novel substrate specificity. However, only a limited 
number of specific enzymes has been identified from natural sources so far. 
Methods of rational design to modify, alter, convert or transfer sequence 
specificity as well as random approaches described above did not enable the 
generation of a novel and user-definablespecificity that was not present in the 
employed starting material. 

Therefore, none of the currently available methods can provide enzymes with a 
novel and user-defined sequence specificity. In contrast, the current invention 
provides such enzymes as well as methods for generating them. 

Summary of the Invention 

The objective of the present invention is to provide a method for the treatment 
of a disease by applying a medicament comprising a protease. Further the 
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present invention provides engineered proteins with novel functions that do not 
exist in the components used for the engineering of such proteins. In particular, 
the invention provides enzymes with user-definable specificities. User-definable 
specificity means that enzymes are provided with specificities that do not exist in 
the components used for the engineering of such enzymes. The specificities can 
be chosen by the user so that one or more intended target substrates are 
preferentially recognised and converted by the enzymes. Furthermore, the 
invention provides enzymes that possess essentially identical sequences to 
human proteins but have different specificities. In a particular embodiment, the 
invention provides proteases with user-definable specificities. 

Furthermore, the present invention is directed to engineered enzymes which are 
fused to one or more further functional components. These further components 
can be proteinacious components which preferably have binding properties and 
are of the group consisting of substrate binding domains, antibodies, receptors or 
fragments thereof. Furthermore, these further components can be further 
functional components, preferably being selected from the group consisting of 
polyethylenglycols, carbohydrates, lipids, fatty acids, nucleic acids, metals, metal 
chelates, and fragments or derivatives thereof. The resulting fusion proteins are 
understood as enzymes with user- definable specificities within the present 
invention. 

Besides, the invention is directed to the application of such enzymes with novel, 
user-definable specificities for therapeutic, research, diagnostic, nutritional, 
personal care or industrial purposes. Moreover, the invention is directed to a 
method for generating engineered enzymes with user-definable specificities. In 
particular, the invention is directed to generate enzymes that possess essentially 
identical sequences to human enzymes but have different specificities. 

This problem has been solved by the embodiments of the invention specified in 
the description below and in the claims. The present invention is thus directed to 
(1) the use of a protease with defined specificity for a target substrate for 
preparing a medicament for the treatment of a specific disease related to said 
target substrate, 
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(2) an engineered enzyme with defined specificity characterized by the 
combination of the following components,: 

(a) a protein scaffold which catalyzes at least one chemical reaction on at least 
one substrate, and 

(b) one or more specificity determining regions (SDRs) located at sites in the 
protein scaffold that enable the resulting engineered protein to discriminate 
between at least one target substrate and one or more different substrates, and 
wherein the SDRs are essentially synthetic peptide sequences; 

(3) the use of an engineered enzyme as defined in (2) above for therapeutic, 
research, diagnostic, nutritional, personal care or industrial purposes, preferably 
for the use as defined in (1 ) above; 

(4) a method for generating engineered enzymes as defined in (2) above having 
specificities towards target substrates, such specificities not being present in the 
individual starting components, comprising at least the following steps: 

(a) providing a protein scaffold which catalyzes at least one chemical reaction on 
at least one substrate, 

(b) generating a library of engineered enzymes by combining the protein scaffold 
from step (a) with fully or partially random peptide sequences at sites in the 
protein scaffold that enable the resulting engineered enzyme to discriminate 
between at least one target substrate and one or more different substrates, and 

(c) selecting out of the library of engineered enzymes generated In step (b) one 
or more enzymes that have specificities towards at least one target substrate; 

(5) a fusion protein which is comprised of at least one engineered enzyme as 
defined in (2) above and at least one further component, preferably the at least 
one further component having binding properties and more preferably being 
selected from the group consisting of antiboides, binding domains, receptors, and 
fragments thereof; 

(6) a composition or pharmaceutical composition comprising one or more 
engineered enzymes as defined in (2) above or a fusion protein as defined in (5) 
above, said pharmaceutical composition may optionally comprise an acceptable 
carrier, excipient and/or auxiliary agent; 

(7) a DNA encoding the engineered enzyme as defined in (2) above; 

(8) a vector comprising the DNA as defined in (7) above; 
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(9) a host cell or transgenic organism being transform ed/transfected with a 
vector as defined in (8) above and/or containing the DNA as defined in (7) 
above; and 

(10) a method for producing the engineered enzyme of (2) above comprising 
cuituring a cell or organism as defined in (8) above and isolating the enzyme 
from the culture broth. 

Brief description of the Figures 

The following figures are provided in order to explain further the present 
invention in supplement to the detailed description: 

Figure 1 Illustrates the three-dimensional structure of human trypsin I with the 
active site residues shown in "ball-and-stick" representation and with the marked 
regions indicating potential SDR insertion sites. 

Figure 2 shows the alignment of the primary amino acid sequence of three 
members of the serine protease class S1 family: human trypsin I s human alpha- 
thrombin and human enteropeptidase (see also SEQ ID NOs: 1, 5 and 6). 

Figure 3 illustrates the three-dimensional structure of subtilisin with the active 
site residues being shown in "ball-and-stick" representation and with the 
numbered regions indicating potential SDR insertion sites. 

Figure 4 shows the alignment of the primary amino acid sequences of four 
members of the serine protease class S8 family: subtilisin E, furin, PC1 and PC5 
(see also SEQ ID NOs: 7-10). 

Figure 5 illustrates the three-dimensional structure of pepsin with the active site 
residues being shown in "ball-and-stick" representation and with the numbered 
regions indicating potential SDR insertion sites. 

Figure 6 shows the alignment of the primary amino acid sequences of three 
members of the A1 aspartfc acid protease family: pepsin, 13-secretase and 
cathepsin D (see also SEQ ID NOs: 11-13). 
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Figure 7 : illustrates the three-dimensional structure of caspase 7 with the active 
site residues being shown in "ball- and- stick" representation and with the 
numbered regions indicating potential SDR insertion sites. 

Figure 8 : shows the primary amino acid sequence of caspase 7 as a member of 
the cysteine protease class C14 family (see also SEQ ID NO: 14). 

Figure 9 depicts schematically the third aspect of the invention. 

Figure 10 shows a Western blot analysis of a culture supernatant of cells 
expressing variants of human trypsin I with SDR1 and SDR2, compared to 
negative controls. 

Figure 1 1 shows the time course of the proteolytic cleavage of a target substrate 
by human trypsin I. 

Figure 12 shows the relative activities of three variants of inventive engineered 
proteolytic enzymes in comparison with human trypsin I on two different peptide 
substrates. 

Figure 13 shows the relative specificities of human trypsin I and variants of 
inventive engineered proteolytic enzymes with one or two SDRs, respectively. 

Figure 14 : shows the relative specificities of human trypsin I and of variants of 
inventive engineered proteolytic enzymes being specific for human TNF-alpha 
with this scaffold on peptides with a target sequence of human TNF-alpha. 

Figure 15 : shows the reduction of cytotoxicity induced by TNF-alpha when 
incubating the TNF-alpha with concentrated supernatant from cultures 
expressing the inventive engineered proteolytic enzymes being specific for 
human TNF-alpha. 

Figure 16 : shows the reduction of cytotoxicity induced by TNF-alpha when 
incubating the TNF-alpha with purified inventive engineered proteolytic enzyme 
being specific for human TNF-alpha. 
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Figure 17 : compares the activity of inventive engineered proteolytic enzymes 
being specific for human TNF-aipha with the activity of human trypsin I on two 
protein substrates: (a) human TNF-alpha; (b) mixture of human serum proteins. 

Raure 18 : showes the specific activity of an inventive engineered proteolytic 
enzyme with specificity for human VEGF. 

Definitions 

In the framework of the present invention the following terms and definitions are 
used. 

The term "protease" means any protein molecule that is capable of hydrolysing 
peptide bonds. This includes naturally-occurring or artificial proteolytic enzymes, 
as well as variants thereof obtained by site-directed or random mutagenesis or 
any other protein engineering method, any active fragment of a proteolytic 
enzyme, or any molecular complex or fusion protein comprising one of the 
aforementioned proteins. A "chimera of proteases" means a fusion protein of two 
or more fragments derived from different parent proteases. 

The term "substrate" means any molecule that can be converted catalytically by 
an enzyme. The term "peptide substrate" means any peptide, oligopeptide, or 
protein molecule of any amino acid composition, sequence or length, that 
contains a peptide bond that can be hydrolyzed catalytically by a protease. The 
peptide bond that is hydrolyzed is referred to as the "cleavage site". Numbering 
of positions in the substrate is done according to the system introduced by 
Schlechter & Berger (Biochem. Biophys. Res. Commun. 27 (1967) 157-162). 
Amino acid residues adjacent N-terminal to the cleavage site are numbered R, 
P2, P3, etc., whereas residues adjacent Oterminal to the cleavage site are 
numbered P/, P 2 \ P 3 \ etc. 

The term "target substrate" describes a user-defined substrate which is 
specifically recognized and converted by an enzyme according to the invention. 
The term "target peptide substrate" describes a user-defined peptide substrate. 
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The term "target specificity- describes the qualitative and quantitative specificity 
of an enzyme that is capable of recognizing and converting a target substrate. 
Catalytic properties of enzymes are expressed using the kinetic parameters "Km" 
or "Michaelis Menten constant", "k^," or "catalytic rate constant", and "k ca , /Km" 
or "catalytic efficiency", according to the definitions of Michaelis and Menten 
(Fersht, A., Enzyme Structure and Mechanism, W. H. Freeman and Company, 
New York, 1995). The term "catalytic activity" describes quantitatively the 
conversion of a given substrate under defined reaction conditions. 

The term "specificity" means the ability of an enzyme to recognize and convert 
preferentially certain substrates. Specificity can be expressed qualitatively and 
quantitatively. "Qualitative specificity" refers to the chemical nature of the 
substrate residues that are recognized by an enzyme. "Quantitative specificity" 
refers to the number of substrates that are accepted as substrates. Quantitative 
specificity can be expressed by the term s, which is defined as the negative 
logarithm of the number of all accepted substrates divided by the number of all 
possible substrates. Proteases, for example, that accept preferantially a small 
portion of all possible peptide substrates have a "high specificity". Proteases that 
accept almost any peptide substrate have a "low specificity". Definitions are 
made in accordance to WO 03/095670 which is therefore incorporated by 
reference. Proteases with very low specificity are also referred to as "unspecific 
proteases". The term "defined specificity" refers to a certain type of specificity, 
i.e. to a certain target subtrate or a set of certain target substrates that are 
preferentially converted versus other substrates. 

The term "engineered" in combination with the term "enzyme" describes an 
enzyme that is comprised of different components and that has features not 
being conferred by the individual components alone. 

The term "protein scaffold" or "scaffold protein" refers to a variety of primary, 
secondary and tertiary polypeptide structures. 

The term "peptide sequence" indicates any peptide sequence used for insertion 
or substitution into or combination with a protein scaffold. Peptide sequences are 
usually obtained by expression from DNA sequences which can be synthesized 
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according to well-established techniques or can be obtained from natural 
sources. Insertion, substitution or combination of peptide sequences with the 
protein scaffold are generated by insertion, substitution or combination of 
oligonucleotides into or with a polynucleotide encoding the protein scaffold. The 
term "synthetic" in combination with the term "peptide sequence" refers to 
peptide sequences that are not present in the protein scaffold in which the 
peptide sequences are inserted or substituted or with which they are combined. 

■ 

The term "components" in combination with the term "engineered enzyme" 
refers to peptide or polypeptide sequences that are combined in the engineering 
of such enzymes. Such components may among others comprise one or more 
protein scaffolds and one or more synthetic peptide sequences. The term "library 
of engineered enzymes" describes a mixture of engineered enzymes, whereby 
every single engineered enzyme is encoded by a different polynucleotide 
sequence. The term "gene library" indicates a library of polynucleotides that 
encodes the library of engineered enzymes. The term "SDR" or "Specificity 
determining region" refers to a synthetic peptide sequence that provides the 
defined specificity when combined with the protein scaffold at sites that enable 
the resulting enzymes to discriminate between the target substrate and one or 
more other substrates. Such sites are termed "SDR sites". 

The terms "tertiary structure similar to the structure of" and "similar tertiary 
structure" in combination with the terms "enzyme" or "protein" refer to proteins 
in which the type, sequence, connectivity and relative orientation of the typical 
secondary structural elements of a protein, e.g. alpha- helices, beta-sheets, beta- 
turns and loops, are similar and the proteins are therefore grouped into the same 
structural or topological class or fold. This includes proteins that have altered, 
additional or deleted structural elements of any type but otherwise unchanged 
topology. Examples of such structural classes are the TNF superfamiiy, the S1 
fold or the S8 fold within the serine proteases, the GPCRs, or the ol 6- barrel fold. 

The term "positions that correspond structurally" indicates amino acids in 
proteins of similar tertiary structure that correspond structurally to each other, 
i.e. they are usually located within the same structural or topological element of 
the structure. Within the structural element they possess the same relative 
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positions with respect to beginning and end of the structural element. If, e.g. the 
topological comparison of two proteins reveals two structurally corresponding 
sequences of different length, then amino acids within, e.g. 20% and 40% of the 
respective region lengths, correspond to each other structurally. 

The term "library of engineered enzymes" of the present invention refers to a 
multiplicity of enzymes or enzyme variants, which may exist as a mixture or in 
isolated form. 

Amino acids residues are abbreviated according to the following Table 1 either in 
one- or in three- letter code. 

Table 1 : Amino acid abbreviations 



Abbreviations 


Amino acid 


A 


Ala 


Alanine 


C 


Cys 


Cysteine 


D 


Asp 


Aspartic acid 


E 


Glu 


Glutam ic acid 


F 


Phe 


Phenylalanine 


G 


Gly 


Glycine 


H 


His 


Histidine 


I 


lie 


Isoleucine 


K 


Lys 


Lysine 


L 


Leu 


Leucine 


M 


Met 


Methionine 


N 


Asn 


Asparagine 


P 


Pro 


Proline 


Q 


Gin 


Glutamine 


R 


Arg 


Arginine 


S 


Ser 


Serine 


T 


Thr 


Threonine 


V 


Val 


Valine 


W 


Trp 


Tryptophane 


Y 


Tyr 


Tyrosine 
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Detailed description of the invention 

The present invention provides engineered proteins with novel functions. In 
particular, the invention provides enzymes with user-definable specificities. In a 
particular embodiment, the invention provides proteases with user-definable 
specificities. Besides, the invention provides applications of such enzymes with 
novel, user-definable specificities for therapeutic, research, diagnostic, 
nutritional, personal care or industrial purposes. Moreover, the invention 
provides a method for generating enzymes with specificities that are not present 
in the components used for the engineering of such enzymes. In particular, the 
invention is directed to the generation of enzymes that have sequences that are 
essentially identical to mammalian, especially human enzymes but have different 
specificities . Moreover, the invention provides libraries of specific engineered 
enzymes with corresponding specificities encoded genetically, a method for the 
generation of libraries of specific engineered enzymes with corresponding 
specificities encoded genetically, and the application of such libraries for 
technical, diagnostic, nutritional, personal care or research purposes. 

A first aspect of the invention Is directed to the application of engineered 
enzymes with specificities for therapeutic, research, diagnostic, nutritional, 
personal care or industrial purposes. The application comprises at least the 
following steps: 

(a) identification of a target peptide substrate whose hydrolysis has a positive 
effect in connection with the intended purpose, such as curing a disease, 
diagnosing a disease, processing of ingredients for human or animal 
nutrition, or other technical processes; 

(b) provision of an engineered enzyme, the enzyme being specific for the 
target peptide identified in step (a); and 

(c) use of the enzyme as provided in step (b) for the intended purpose. 

In a first variant of this aspect of the invention, the engineered enzyme is used 
as a therapeutic means to inactivate a disease- related target substrate. This 
application comprises at least the following steps: 

(a) identification of a target substrate whose function is connected to a 
disease and whose inactivation has a positive effect in connection with the 
disease, and determination of a target site within the target substrate 
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characterized by the fact that modification at the target site leads to the 
inactivation of the target substrate; 

(b) provision of an engineered enzyme, the enzyme being specific for the 
target site identified in step (a); and 

(c) use of the enzyme for the inactivation of the target substrate inside or 
outside the human body. 

Preferably, the scaffold is a protease and the modification is hydrolysis of a 
target site in a protein target. Preferably, the hydrolysis leads to the activation or 
inactivation of the peptide or protein target. Potential peptide or protein targets 
include soluble proteins, in particular cytokines, such as proteins of the TNF- 
superfamily, interleukines, interferons, chemokines and growth factors; 
hormones; toxins; enzymes, such as oxidoreductases, transferases, hydrolases, 
lyases, isomerases and ligases; structural proteins, such as collagen; and 
immunoglobulins; or membrane associated proteins, in particular single pass 
transmembrane proteins; multipass transmembrane proteins, such as G-protein 
coupled receptors, ion channels and transporters; lipid-anchored membrane 
proteins and GPI-anchored membrane proteins. 

In a first embodiment of this variant the engineered enzyme is a protease and is 
capable of hydrolysing human tumor necrosis factor- alpha (hTNF-a). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, rheumatoid arthritis, 
inflammatory bowel diseases, psoriasis, Crohn's disease, Ulcerative colitis, 
diabetes type II, classical Hodgkin's Lymphoma (cHL), Grave's disease, 
Hashimoto's thyroiditis, Sjogren's Syndrome, systemic lupus erythematosus, 
multiple sclerosis, Systemic inflammatory response syndrome (SIRS) which leads 
to distant organ damage and multiple organ dysfunction syndrome (MODS), 
eosinophilia, neurodegenerative disease, stroke, closed head injury, encephalitis, 
GNS disorders, asthma, rheumatoid arthritis, sepsis, vasodilation, intravascular 
coagulation and multiple organ failure, as well as other diseases connected with 
hTNF-a. Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hTNF-a (SEQ ID NO:96). More preferably said enzyme or said fusion 

protein is capable of hydrolysing the peptide bonds between positions 31/32, 
32/33, 44/45, 45/46, 87/88, 128/129, 130/131, 140/141 and/or 141/142 (most 
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preferred between positions 31/32, 32/33 and/or 45/46) in hTNF-a, or a peptide 
bond in proximity to these positions in hTNF-a, or peptide bonds in protein 
targets related to hTNF-a between positions having structural homology or 
sequence homology to these positions. In this embodiment it is most preferred 
that the protease has the a sequence shown in SEQ ID NO: 74, SEQ ID NO: 75 
and is capable of hydrolysing hTNF-a at positions 31/32 and/or 32/33. 

In a second embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human Tumor necrosis factor ligand superfamily member 5 
(hCD40-L). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, systemic 
lupus erythematosus and classical Hodgkin's Lymphoma (cHL), as well as other 
diseases connected with hCD40-L. Preferably, said enzyme or said fusion protein 
is capable of specifically inactivating hCD40-L (SEQ ID NO: 143). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 133/134, 145/146, 165/166, 200/201, 201/202, 207/208 
and/or 216/217 (most preferred between positions 133/134, 165/166, 201/202 
and/or 216/217) in hCD40-L, or a peptide bond in proximity to these positions in 
hCD40-L, or peptide bonds in protein targets related to hCD40-L at positions 
having structural homology or sequence homology to these positions. 

In a third embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Macrophage migration inhibitory factor (hMIF). The enzymes 
or the fusion protein can thus be used for preparing medicaments for the 
treatment of diseases, such as, but not limited to, inflammatory diseases, as well 
as other diseases connected with hMIF. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hMIF (SEQ ID NO: 109). More 
preferably said engineered or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 16/17, 44/45, 66/67, 73/74, 77/78, 88/89, 
92/93 and/or 100/101 (most preferred between positions 16/17 and/or 92/93) in 
hMIF, or a peptide bond in proximity to these positions in hMIF, or peptide bonds 
in protein targets related to hMIF at positions having structural homology or 
sequence homology to these positions. 

In a fourth embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human lnterleukin-1 beta precursor (hll_-1 beta). The enzymes or 
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the fusion protein can thus be used for preparing medicaments for the treatment 
of diseases, such as, but not limited to, diabetes, brain inflammation in cancer, 
arthritis, autoimmune and inflammatory diseases, as well as other diseases 
connected with hlL-1 beta. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating ML-1 beta (SEQ ID NO: 112). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 24/25, 35/36, 46/47, 54/55, 74/75, 75/76, 76/77, 77/78, 
.86/87, 88/89, 93/94, 94/95, 97/98 and/or 150/151 (most preferred between 
positions 35/36, 75/76, 76/77, 88/89, 93/94, 94/95 and/ or 150/151) in hlL-1 
beta, or a peptide bond in proximity to these positions in hlL-1 beta, or peptide 
bonds in protein targets related to hlL-1 beta at positions having structural 
homology or sequence homology to these positions. 

In a fifth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Interleukin 2 (hlL-2). The enzymes or the fusion protein can 
thus be used for preparing medicaments for the treatment of diseases, such as, 
but not limited to, Tcell leukemia and hairy cell leukemia, Crohn's disease, 
Ulcerative colitis, Grave's disease, Hashimoto's thyroiditis, Sjogren's syndrome, 
systemic lupus erythematosus, multiple sclerosis, asthma and chronic obstructive 
pulmonary and classical Hodgkin's Lymphoma (cHL), as well as other diseases 
connected with hlL-2. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating hlL-2 (SEQ ID NO:99). More preferably said enzyme or 
said fusion protein is capable of hydrolysing the peptide bonds between positions 
20/21, 32/33, 38/39, 43/44, 45/46 48/49, 49/50, 54/55, 64/65, 76/77, 83/84, 
84/85, 107/108, 109/110 and/or 120/121 (most preferred between positions 
109/110) in hlL-2, or a peptide bond in proximity to these positions in hlL-2, or 
peptide bonds in protein targets related to hlL-2 at positions having structural 
homology or sequence homology to these positions. 

In a sixth embodiment of this variant the enzyme is a protease and Is capable of 
hydrolysing human Interleukin 3 (hlL-3). The enzymes or the fusion protein can 
thus be used for preparing medicaments for the treatment of diseases, such as, 
but not limited to, classical Hodgkin's Lymphoma (cHL) and eosinophilia, as well 
as other diseases connected withh IL-3. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hlL-3 (SEQ ID NO: 148). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
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peptide bonds between positions 21/22, 28/29, 36/37, 44/45, 46/47, 51/52, 
63/64, 66/67, 79/80, 94/95, 101/102, 108/109 and/or 109/110 (most preferred 
between positions 21/22, 28/29, 46/47, 63/64, 66/67, 79/80 and/or 101/102) in 
hlL-3, or a peptide bond in proximity to these positions in hlL-3, or peptide 
bonds in protein targets reiated to hlL-3 at positions having structural homology 
or sequence homology to these positions. 

« 

In a seventh embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human Interleukin 4 (hlL-4). The enzymes or the fusion protein 
can thus be used for preparing medicaments for the treatment of diseases, such 
as, but not limited to, classical Hodgkin's Lymphoma (cHL), Grave's disease, 
Hashimoto's thyroiditis, Sjogren's syndrome, Asthma, chronic obstructive 
pulmonary disease and allergic inflammatory reactions, as well as other diseases 
connected with hlL-4. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating hlL-4 (SEQ ID NO:118). More preferably said enzyme or 
said fusion protein is capable of hydrolysing the peptide bonds between positions 
4/5, 12/13, 31/32, 37/38, 61/62, 62/63, 64/65, 91/92, 102/103, 121/122 
and/or 126/127 (most preferred between positions 4/5, 61/62, 62/63, 64/65 
and/or 121/122) in hlL-4, or a peptide bond in proximity to these positions in 
hlL-4, or peptide bonds in protein targets related to hlL-4 at positions having 
structural homology or sequence homology to these positions. 

In a eighth embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human lnterleukin-5 (hlL-5). The enzymes or the fusion protein 
can thus be used for preparing medicaments for the treatment of diseases, such 
as, but not limited to, classical Hodgkin's Lymphoma (cHL), asthma, chronic 
obstructive pulmonary disease, eosinophilia, allergic inflammatory diseases, as 
well as other diseases connected with hlL-5. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hlL-5 (SEQ ID NO: 133). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 12/13, 32/33, 67/68, 76/77, 77/78, 80/81, 
83/84, 84/85, 85/86, 90/91, 91/92, 92/93 and/or 98/99 (most preferred 
between positions 90/91, 91/92, 92/93 and/or 98/99) in hlL-5, or a peptide 
bond in proximity to these positions in hlL-5, or peptide bonds In protein targets 
related to hlL-5 at positions having structural homology or sequence homology 
to these positions. 
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In a ninth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human lnterleukin-6 (hlL-6). The enzymes or the fusion protein can 
thus be used for preparing medicaments for the treatment of diseases, such as, 
but not limited to, classical Hodgkin's Lymphoma (cHL), breast cancer, renal cell 
carcinoma, multiple myeloma, lymphoma, leukemia, Grave's disease, 
Hashimoto's thyroiditis, Sjogren's syndrome, systemic lupus erythematosus, 
Systemic inflammatory response syndrome (SIRS) which leads to distant organ 
damage and muitpile organ dysfunction syndrome (MODS), chronic obstructive 
pulmonary disease (COPD), Castleman's diseases, inflammatory bowel diseases, 
Crohn's disease, as well as other diseases connected with hlL-6. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hlL-6 (SEQ ID 
NO: 134). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 32/33, 35/36, 55/56, 71/72, 
129/130, 130/131, 132/133, 135/136, 141/142, 161/162, 180/181 and/or 
183/184 (most preferred between positions 135/136 and/or 141/142) in hlL-6, 
or a peptide bond in proximity to these positions in hiL-6, or peptide bonds in 
protein targets related to hlL-6 at positions having structural homology or 
sequence homology to these positions. 

In a tenth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Interleukin 8 (hlL-8). The enzymes or the fusion protein can 
thus be used for preparing medicaments for the treatment of diseases, such as, 
but not limited to, Crohn's disease, Ulcerative colitis, classical Hodgkin's 
Lymphoma (cHL), Systemic inflammatory response syndrome (SIRS) which leads 
to distant organ damage and multple organ dysfunction syndrome (MODS), 
chronic obstructive pulmonary disease (COPD), endometriosis, psoriasis and 
atherosclerotic lesions, as well as other diseases connected with hlL-8. 
Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hIL-8 (SEQ ID NO: 100). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 1/12, 
15/16, 45/46, 47/48, 52/53, 54/55, 60/61, 64/65 and/or 67/68 (most preferred 
between positions 45/46) in hlL-8, or a peptide bond in proximity to these 
positions in hlL-8, or peptide bonds in protein targets related to hlL-8 at 
positions having structural homology or sequence homology to these positions. 
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In a eleventh embodiment of this variant the enzyme Is a protease and is 
capable of hydrolysing human Interleukin- 10 (hlt_-10). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, classical Hodgkin's Lymphoma (cHL) and 
diseases related to the suppression of cytotoxic T-cells, as well as other diseases 
connected with hll_-10. Preferably, said enzyme or said fusion protein is capable 
of specifically inactivating hlL-10 (SEQ ID NO:135). More preferably said enzyme 
or said fusion protein is capable of hydrolysing the peptide bonds between 
positions 24/25, 25/26, 27728, 28/29, 40/41, 44/45, 49/50, 57/58, 59/60, 
84/85, 86/87, 106/107, 107/108, 110/111, 130/131, 134/135, 137/138, 
138/139 and/or 144/145 (most preferred between positions 24/25, 27/28, 
44/45, 49/50, 86/87, 137/138 and/or 144/145) in hlL-10, or a peptide bond in 
proximity to these positions in hlL-10, or peptide bonds in protein targets related 
to hlL-10 at positions having structural homology or sequence homology to these 
positions. 

In a twelfth embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human Interleukin 12 beta chain (hlL-12B). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, Crohn's disease and classical Hodgkin's 
Lymphoma (cHL), as well as other diseases connected with hlL-1213. Preferably, 
said enzyme or said fusion protein is capable of specifically inactivating hi L- 12 13 
(SEQ ID NO: 97). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 14/15, 18/19, 29/30, 34/35, 
87/88, 99/100, 102/103, 104/105, 161/162, 174/175, 222/223, 225/226, 
228/229, 238/239, 268/269 and/or 293/294 (most preferred between positions 
18/19, 34/35, 87/88 and/or 161/162) in ML-12B, or a peptide bond in proximity 
to these positions in hlL-1213, or peptide bonds in protein targets related to hlL- 
128 at positions having structural homology or sequence homology to these 
positions. 

In a thirteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Interleukin 13 (hlL-13). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
cancer, such as, but not limited to, classical Hodgkin's Lymphoma (cHL), 
eosinophilia, asthma, chronic obstructive pulmonary disease, fibrosis, psoriasis, 
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atopic dermatitis and Ulcerative colitis, as well as other diseases connected with 
hlL-13. Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hlL-13 (SEQ ID NO:1 19). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 25/26, 
62/63, 65/66, 86/87, 87/88, 98/99, 108/109 and/or 111/112 (most preferred 
between positions 87/88) in hlL-13, or a peptide bond in proximity to these 
positions in hlL-13, or peptide bonds in protein targets related to hlL-13 at 
positions having structural homology or sequence homology to these positions. 

In a fourteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Interleukin 18 (hlL-18). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, Crohn's disease, inflammation liver injuries, 
pulmonary tuberculosis, plural tuberculosis and rheumatoid arthritis, as well as 
other diseases connected with hlL-18. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hlL-18 (SEQ ID NO:98). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 17/18, 32/33, 37/38, 39/40, 40/41, 53/54, 
58/59, 79/80, 90/91, 93/94, 98/99, 110/111, 120/121, 123/124, 131/132, 
132/133, 142/143, 147/148 and/or 157/158 (most preferred between positions 
37/38, 132/133, 142/143 and/or 157/158) in hlL-18, or a peptide bond in 
proximity to these positions in hlL-18, or peptide bonds in protein targets related 
to hlL-18 at positions having structural homology or sequence homology to these 
positions. 

In a fifteenth embodiment of this variant the enzyme is a protease and is capable 
of hydrolysing human Interferon-garnma (hlFN-gamma). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, classical Hodgkin's Lymphoma (cHL), 
Crohn's disease and type I diabetes, as well as other diseases connected with 
hlFN-gamma. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating hlFN-gamma (SEQ ID NO: 137). More preferably said 
enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 2/3, 6/7, 13/14, 21/22, 24/25, 34/35, 36/37, 37/38, 62/63, 
68/69, 83/84, 86/87, 90/91, 102/103, 107/108 and/or 108/109 (most preferred 
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between positions 13/14, 24/25, 37/38, 62/63, 68/69, 102/103 and/or 107/108) 
in hlFN-gamma, or a peptide bond in proximity to these positions in hlFN- 
gamma, or peptide bonds in protein targets related to hlFN-gamma at positions 
having structural homology or sequence homology to these positions. 

In a sixteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human small inducible cytokine A2 (hCCL2). The enzymes 
or the fusion protein can thus be used for preparing medicaments for the 
treatment of diseases, such as, but not limited to, Crohn's disease and Ulcerative 
colitis, as well as other diseases connected with hCCL2. Preferably, said enzyme 
or said fusion protein is capable of specifically inactivating hCCL2 (SEQ ID 
NO: 102). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 3/4, 13/14, 18/19, 19/20, 
24/25, 29/30, 38/39, 54/55, 56/57, 58/59, 62/63, 65/66 and/or 68/69 (most 
preferred between positions 19/20, 29/30, 38/39, 54/55 and/or 62/63) in 
hCCL2, or a peptide bond in proximity to these positions in hCCL2, or peptide 
bonds in protein targets related to hCCL2 at positions having structural homology 
or sequence homology to these positions. 

In a seventeenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Eotaxin (hCCL11). The enzymes or the fusion 
protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, Crohn's disease and Ulcerative colitis, 
classical Hodgkin's Lymphoma (cHL), chronic pathophysiologic dysfunction, 
characterized by an influx mainly of Th2 cells, and eosinophilia, as well as other 
diseases connected with hCCL11. Referably, said enzyme or said fusion protein 
is capable of specifically inactivating hCCL1 1 (SEQ ID NO:101). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 11/12, 16/17, 17/18, 22/23, 27/28, 33/34, 44/45, 47/48, 
48/49, 52/53, 54/55, 56/57, 60/61, 66/67 and/or 73/74 (most preferred 
between positions 48/49 and/or 66/67) in hCCL11, or a peptide bond in 
proximity to these positions in hCCL11, or peptide bonds in protein targets 
related to hCCL1 1 at positions having structural homology or sequence homology 
to these positions. 

In an eighteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Vascular endothelial growth factor (hVEGF). The 
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enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, all solid tumors and 
metastatic solid tumors, inflammatory breast cancer, as well as other diseases 
connected with hVEGF. Preferably, said enzyme or said fusion protein is capable 
of specifically inactivating hVEGF (SEQ ID NO: 103). More preferably said enzyme 
or said fusion protein is capable of hydrolysing the peptide bonds between 
positions 16/17, 19/20, 23/24, 34/35, 41/42, 56/57, 62/63, 63/64, 64/65, 
65/66, 82/83, and/or 84/85 (most preferred between positions 23/24, 41/42, 
63/64, 82/83 and/or 84/85) in hVEGF, or a peptide bond in proximity to these 
positions in hVEGF, or peptide bonds in protein targets related to hVEGF at 
positions having structural homology or sequence homology to these positions. 

In an ninteenth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Transforming growth factor beta 1 (hTGF- 81). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, a variety of cancers, 
including breast cancer, colorectal cancer and classical Hodgkin's Lymphoma 
(cHL), fibrosis, suppression of cell-mediated immunity, glaucoma, diffuse 
systemic sclerosis as well as other diseases connected with hTGF- 81 . Preferably, 
said enzyme or said fusion protein is capable of specifically inactivating hTGF- 61 
(SEQ ID NO:104). More preferably said enzyme or said fusion protein is capable 
of hydrolysing the peptide bonds between positions 23/24, 25/26, 26/27, 27/28, 
37/38, 55/56 and/or 94/95 (most preferred between positions 25/26, 55/56 
and/or 94/95) in hTGF- 81 , or a peptide bond in proximity to these positions in 
hTGF- 61, or peptide bonds in protein targets related to hTGF- 81 at positions 
having structural homology or sequence homology to these positions. 

In a twentieth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Somatotropin (human Growth hormone; hGH). 
The enzymes or the fusion protein can thus be used for preparing medicaments 
for the treatment of diseases, such as, but not limited to, acromegaly, diabetes 
and diabetic kidney disease including renal hypertrophy and glomerular 
enlargement and cardiovascular disorders, as well as other diseases connected 
with hGH. Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hGH (SEQ ID NO: 121). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 8/9, 
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16/17, 19/20, 26/27, 33/34, 38/39, 41/42, 70/71, 77/78, 94/95, 103/104, 
112/113, 115/116, 116/117, 130/131, 147/148, 154/155 and/or 178/179 (most 
preferred between positions 112/113, 147/148 and/ or 154/155) in hGH, or a 
peptide bond in proximity to these positions in hGH, or peptide bonds in protein 
targets related to hGH at positions having structural homology or sequence 
homology to these positions. 

In a twenty-first embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Insulin- like growth factor II (hIGF-ll). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, diabetes and diabetic 
kidney disease, as well as other diseases connected with hIGF-ll. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hIGF-ll (SEQ 
ID NO: 122). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 15/16, 23/24, 24/25, 34/35, 
37/38, 38/39, 48/49 and/or 49/50 (most preferred between positions 23/24) in 
hIGF-ll, or a peptide bond in proximity to these positions in hIGF-ll, or peptide 
bonds in protein targets related to hIGF-il at positions having structural 
homology or sequence homology to these positions. 

In a twenty-second embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Hepatocyte growth factor (hHGF). The enzymes or 
the fusion protein can thus be used for preparing medicaments for the treatment 
of diseases, such as, but not limited to, angiogenic disorders and hepatocellular 
carcinoma, as well as other diseases connected with hHGF. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hHGF (SEQ ID 
NO: 120). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 54/55, 60/61, 62/63, 63/64, 
68/69, 76/77, 112/113, 123/124, 134/135, 168/169, 198/199 and/or 202/203 
(most preferred between positions 63/64, 68/69, 76/77, 168/169 and/or 
202/203) in hHGF, or a peptide bond in proximity to these positions in hHGF, or 
peptide bonds in protein targets related to hHGF at positions having structural 
homology or sequence homology to these positions. 

In a twenty-third embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human hlnsulin (hlnsulin). The enzymes or the fusion 
protein can thus be used for preparing medicaments for the treatment of 
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diseases, such as, but not limited to, insulin overdosage, as well as other 
diseases connected with hlnsulin. Preferably, said enzyme or said fusion protein 
is capable of specifically inactivating hlnsulin B chain (SEQ ID NO: 105) and/or 
hlnsulin A chain (SEQ ID NO:106). More preferably said enzyme or said fusion 
protein is capable of hydrolysing the peptide bonds between positions 16/17 
and/or 22/23 in hlnsulin B and/or between position 14/15 in Insulin A, or a 
peptide bond in proximity to these positions in hlnsulin A or B, or peptide bonds 
in protein targets related to hlnsulin A or B at positions having structural 
homology or sequence homology to these positions. 

In a twenty- fourth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human hGhrelin (hGhrelin). The enzymes or the fusion 
protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, obesity, as well as other diseases 
connected with hGhrelin. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating hGhrelin (SEQ ID NO: 107). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 1/2, 2/3, 3/4 and/or 4/5 in hGhrelin, or a peptide bond in 
proximity to these positions in hGhrelin, or peptide bonds in protein targets 
related to hGhrelin at positions having structural homology or sequence 
homology to these positions. 

In a twenty-fifth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human angiotensinogen (angiotensin). The enzymes or 
the fusion protein can thus be used for preparing medicaments for the treatment 
of diseases, such as, but not limited to, essential hypertension, as well as other 
diseases connected with angiotensin. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating angiotensin (SEQ ID NO: 108). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 1/2, 3/4 and/or 7/8 (most preferred between 
positions 3/4) in angiotensin, or a peptide bond in proximity to these positions in 
angiotensin, or peptide bonds in protein targets related to angiotensin at 
positions having structural homology or sequence homology to these positions. 

In a twenty-sixth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human leptln precursor (leptin). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
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diseases, such as, but not limited to, obesity, as well as other diseases 
connected with leptin. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating leptin (SEQ ID NO:127). More preferably said enzyme or 
said fusion protein is capable of hydrolysing the peptide bonds between positions 
8/9, 9/10, 15/16, 23/24, 40/41, 53/54, 71/72, 85/86, 94/95, 108/109 and/or 
141/142 (most preferred between positions 9/10, 40/41, 71/72, 94/95 and/or 
108/109) in leptin, or a peptide bond in proximity to these positions in leptin, or 
peptide bonds in protein targets related to leptin at positions having structural 
homology or sequence homology to these positions. 

In a twenty-seventh embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing Protective antigen (PA-83). The enzymes or the fusion 
protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, anthrax infection, as well as other diseases 
connected with PA-83. Preferably, said enzyme or said fusion protein is capable 
of specif icaliy inactivating PA-83 (SEQ ID NO: 123). More preferably said enzyme 
or sard fusion protein is capable of hydrolysing the peptide bonds between 
positions 72/73, 73/74, 92/93, 93/94, 131/132, 149/150, 178/179, 213/214, 
214/215, 387/388, 425/426, 426/427, 427/428, 453/454, 520/521, 608/609, 
617/618, 671/672, 679/680, 680/681, 683/684 and/or 684/685 (most preferred 
between positions 72/73, 73/74, 93/94, 149/150, 387/388, 425/426, 427/428 
and/or 683/684) in PA-83, or a peptide bond in proximity to these positions in 
PA-83, or peptide bonds in protein targets related to PA-83 at positions having 
structural homology or sequence homology to these positions. 

In a twenty-eighth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human plasminogen (plasminogen). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, thrombosis, as well as other diseases 
connected with plasminogen. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating plasminogen (SEQ ID NO: 140). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bond between position 580/581 in plasminogen, or a peptide bond in 
proximity to this position in plasminogen, or peptide bonds in protein targets 
related to plasminogen at positions having structurai homology or sequence 
homology to these positions. 
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In a twenty-ninth embodiment of this variant the enzyme is a protease and Is 
capable of hydrolysing human Prothrombin (thrombin). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, bleeding, as well as other diseases 
connected with thrombin. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating thrombin (SEQ ID NO:149). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 198/199, 327/328, 363/364 (most preferred between 
positions 327/328 and/or 363/364) in thrombin, or a peptide bond in proximity 
to these positions in thrombin, or peptide bonds in protein targets related to 
thrombin at positions having structural homology or sequence homology to these 
positions 

In a thirty embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human beta-secretase. The enzymes or the fusion protein can thus 
be used for preparing medicaments for the treatment of diseases, such as, but 
not limited to, Alzheimer, as well as other diseases connected with human beta- 
secretase precursor. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating human beta-secretase precursor (SEQ ID NO: 139). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 61/62, 64/65, 159/160, 238/239, 239/240, 
246/247, 256/257, 330/331 and/or 365/366 (most preferred between positions 
61/62, 246/247 and/or 365/366) in human beta-secretase precursor, or a 
peptide bond in proximity to these positions in human beta-secretase precursor, 
or peptide bonds in protein targets related to human beta-secretase precursor at 
positions having structural homology or sequence homology to these positions. 

In a thirty-first embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human matrix metalloproteinase-2 (hMMP-2). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, a variety of cancers 
including bladder cancer, breast tumor cancer, gastric cancer and lung cancer, as 
well as other diseases connected with hMMP-2. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hMMP-2 (SEQ ID NO: 131). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 62/63, 68/69, 75/76, 76/77, 79/80, 88/89, 
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110/111, 112/113, 115/116, 120/121, 164/165, 254/255, 267/268, 296/297, 
324/325, 325/326, 382/383, 383/384, 470/471, 500/501, 550/551, 564/565, 
595/596, 597/598, 608/609, 646/647, 649/650 and/or 650/651 (most preferred 
between positions 68/69, 115/116, 120/121, 164/165, 325/326, 383/384, 
470/471, 500/501, 595/596, 608/609 and/or 650/651) in hMMP-2, or a peptide 
bond in proximity to these positions in hMMP-2, or peptide bonds in protein 
targets related to hMMP-2 at positions having structural homology or sequence 
homology to these positions. 

In a thirty-second embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human matrix metalloproteinase-9 (hMMP-9). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, a variety of cancers 
including bladder cancer, breast tumor cancer, gastric cancer and lung cancer, as 
well as other diseases connected with hMMP-9. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hMMP-9 (SEQ ID NO: 132). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 41/42, 42/43, 106/107, 113/114, 134/135, 
160/161, 162/163, 163/164, 222/223, 226/227, 265/266, 266/267, 267/268, 
284/285, 309/310, 321/322, 322/323, 324/325, 356/357, 380/381, 433/434 
and/or 440/441 (most preferred between positions 160/161, 163/164, 226/227, 
284/285, 321/322, 322/323 and/or 433/434) in hMMP-9, or a peptide bond in 
proximity to these positions in hMMP-9, or peptide bonds in protein targets 
related to hMMP-9 at positions having structural homology or sequence 
homology to these positions. 

in a thirty-third embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing HIV membrane glycoprotein (GP120). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, AIDS or HIV infection, as well as other 
diseases connected with GP120 or HIV infection. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating GP120 (SEQ ID NO: 1 24) . 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 97/98, 99/100, 107/108, 113/114, 117/118, 
227/228, 231/233, 279/280, 335/336, 337/338, 368/369, 412/413, 419/420, 
429/430, 444/445, 457/458, 474/475, 476/477, 477/478, 485/486 and/or 
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490/491 (most preferred between positions 99/100, 368/369, 412/413, 
419/420, 444/445 and/or 490/491) in GP120, or a peptide bond in proximity to 
these positions in GP120, or peptide bonds in protein targets related to GP120 at 
positions having structural homology or sequence homology to these positions. 

In a thirty-fourth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Cytotoxic T-lymphocyte protein 4 (hCTLA-4). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, breast cancer, as well as 
other diseases connected with hCTLA-4. Preferably, said enzyme or said fusion 

* 

protein is capable of specifically inactivating hCTLA-4 (SEQ ID NO: 144). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 14/15, 28/29, 33/34, 38/39, 41/42, 62/63, 
72/73, 85/86, 95/96, 100/101, 105/106, 119/120, 125/126 and/or 127/128 
(most preferred between positions 14/15, 28/29, 38/39, 41/42, 62/63 and/or 
85/86) in hCTLA-4, or a peptide bond in proximity to these positions in hCTLA-4, 
or peptide bonds in protein targets related to hCTLA-4 at positions having 
structural homology or sequence homology to these positions. 

In a thirty-fifth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Integrin alpha-2 (hVLA-2). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, renal tumors, uveal melanomas and 
gastrointestinal tumors, as well as other diseases connected with hVLA-2. 
Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hVLA-2 (SEQ ID NO: 147). More preferably said enzyme or said 
fusion protein is capable of hydrolysing the peptide bonds between positions 
160/161, 174/175, 201/202, 219/220, 231/232, 232/233, 233/234, 243/244, 
259/260, 264/265, 268/269, 288/289, 292/293, 294/295, 298/299, 301/302, 
310/311 and/or 317/318 (most preferred between positions 160/161, 174/175, 
201/202, 219/220, 243/244, 264/265, 292/293 and/or 294/295) in hVLA-2, or a 
peptide bond in proximity to these positions in hVLA-2, or peptide bonds in 
protein targets related to hVLA-2 at positions having structural homology or 
sequence homology to these positions. 

In a thirty-sixth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Vascular endothelial growth factor receptor 1 
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(hVEGFR 1). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, solid 
tumors and metastatic solid tumors, astrocytic brain tumors, pancreatic cancer, 
metastatic renal cancer, metastatic solid tumors, as well as other diseases 
connected with hVEGFR 1. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating hVEGFR 1 (SEQ ID NO: 114). More preferably 
said enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 189/190, 190/191, 224/225 and/or 331/332 (most preferred 
between positions 189/190 and/or 331/332) in hVEGFR 1, or a peptide bond in 
proximity to these positions in hVEGFR 1, or peptide bonds in protein targets 



related to hVEGFR 1 at positions having structural homology or sequence 
homology to these positions. 

In a thirty-seventh embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Vascular endothelial growth factor receptor 2 



(hVEGFR 2). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, solid 
tumors and metatstatic solid tumors, pancreatic cancer, metastatic renal cancer, 
metastatic CRC, as well as other diseases connected with hVEGFR 2. Preferably, 
said enzyme or said fusion protein is capable of specifically inactivating hVEGFR 
2 (SEQ ID NO: 115). More preferably said enzyme or said fusion protein is 
capable of hydrolysing the peptide bonds between positions 214/215, and/or 
323/324 (most preferred between position 214/215) in hVEGFR 2, or a peptide 
bond in proximity to these positions In hVEGFR 2, or peptide bonds in protein 
targets related to hVEGFR 2 at positions having structural homology or sequence 
homology to these positions. 

In a thirty-eighth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Epidermal growth factor receptor (hEGFr). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of disesaes, such as, but not limited to, bladder cancer, breast 
cancer, cervical cancer, colorectal cancer, endometrial cancer, oesophageal 
cancer, head and neck cancer, gastric cancer, non-small-ceil lung carcinoma and 
ovarian cancer, as well as other diseases connected with hEGFr. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hEGFr (SEQ 
ID N0:116). More preferably said enzyme or said fusion protein is capable of 
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hydrolysing the peptide bonds between positions 20/21, 29/30, 48/49, 74/75, 
165/166, 202/203, 220/221, 246/247, 251/252, 269/270, 270/271, 304/305, 
305/306, 357/358, 430/431, 443/444, 454/455, 455/456, 463/464, 465/466, 

476/477, 507/508 and/or 509/510 in hEGFr, or a peptide bond in proximity to 
these positions in hEGFr, or peptide bonds in protein targets related to hEGFr at 
positions having structural homology or sequence homology to these positions. 

In a thirty-ninth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Epithelial cell adhesion molecule (hEp-CAM). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, colorectal cancer, as well 
as other diseases connected with hEp-CAM. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hEp-CAM (SEQ ID NO: 125). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 14/15, 19/20, 25/26, 30/31, 33/34, 55/56 
an/or 70/71 (most preferred between psitions 14/15, 30/31 and/or 70/71) in 
hEp-CAM, or a peptide bond in proximity to these positions in hEp-CAM, or 
peptide bonds in protein targets related to hEp-CAM at positions having 
structural homology or sequence homology to these positions. 

In a forty embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human insulin- like growth factor I receptor (hlGF-1r). The enzymes 
or the fusion protein can thus be used for preparing medicaments for the 
treatment of diseases, such as, but not limited to, a variety of cancers including 
breast cancer, as well as other diseases connected with hlGF-1r. Preferably, said 
enzyme or said fusion protein is capable of specifically inactivating hlGF-1r (SEQ 
ID NO: 126). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 59/60, 115/116, 146/147, 
171/172, 191/192, 290/291, 306/307, 307/308, 335/336, 336/337, 455/456 
and/or 470/471 (most preferred between positions 306/307, 307/308, 335/336 
and/or 470/471) in hlGF-1r, or a peptide bond in proximity to these positions in 
hlGF-1r, or peptide bonds in protein targets related to hlGF-1r at positions 
having structural homology or sequence homology to these positions. 

In a forty-first embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human T-cell surface antigen CD2 precursor (hCD2). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
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the treatment of diseases, such as, but not limited to, psoriasis, as well as other 
diseases connected with hCD2. Preferably, said enzyme or said fusion protein is 
capable of specifically inactivating hCD2 (SEQ ID NO: 128). More preferably said 
enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 42/43, 43/44, 48/49, 49/50, 51/52, 54/55, 63/64, 69/70, 
89/90 and/or 91/92 (most preferred between positions 43/44, 51/52 and/or 
89/90) in hCD2, or a peptide bond in proximity to these positions in hCD2, or 
peptide bonds in protein targets related to hCD2 at positions having structural 
homology or sequence homology to these positions. 

In a forty-second embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human T-cell surface glycoprotein CD4 (hCD4). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, psoriasis, transplant 
rejection, graft-versus-host colitis, autoimmune disorders and rheumatoid 
arthritis, as well as other diseases connected with hCD4. Preferably, said enzyme 
or said fusion protein is capable of specifically inactivating hCD4 (SEQ ID 
NO: 129). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 166/167, 167/168, 206/207, 
219/220, 224/225, 226/227, 251/252, 252/253, 322/323, 329/330 and/or 
334/335 (most preferred between positions 206/207, 219/220, 251/252 and/or 
252/253) in hCD4, or a peptide bond in proximity to these positions in hCD4, or 
peptide bonds in protein targets related to hCD4 at positions having structural 
homology or sequence homology to these positions. 

In a forty-third embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Integrin alpha-L (hCD11a). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, psoriasis as well as other diseases 
connected with hCD1 1a. Preferably, said enzyme or said fusion protein is capable 
of specifically inactivating hCD11a (SEQ ID NO: 130). More preferably said 
enzyme or said fusion protein is capable of hydrolysing the peptide bonds 
between positions 145/146, 152/153, 156/157, 159/160, 160/161, 177/178, 
178/179, 189/190, 190/191, 191/192, 193/194, 197/198, 200/201, 221/222, 
229/230, 249/250, 253/254, 268/269, 290/291, 297/298, 304/305 and/or 
305/306 (most preferred between positions 145/146, 159/160, 160/161, 
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189/190, 229/230, 249/250, 268/269, 297/298, 304/305 and/or 305/306) in 
hCD11a, or a peptide bond in proximity to these positions in hCD11a, or peptide 
bonds in protein targets related to hCD11a at positions having structural 
homology or sequence homology to these positions. 

In a forty-fourth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Interferon-gamma receptor alpha chain (hlFN- 
gamma-R1). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, classical 
Hodgkin's Lymphoma (cHL) and type I diabetes, as well as other diseases 
connected with hlFN-gamma-R1 . Preferably, said enzyme or said fusion protein 
is capable of specifically inactivating hlFN-gamma- R1 (SEQ ID NO:136). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 49/50, 52/53, 62/63, 106/107, 122/123, 
174/175, 215/216 and/or 222/223 (most preferred between positions 49/50, 
122/123, 174/175 and/or 215/216) in hiFN-gamma-Rt , or a peptide bond in 
proximity to these positions in hlFN-gamma-R1 , or peptide bonds in protein 
targets related to hlFN-gamma-R1 at positions having structural homology or 
sequence homology to these positions. 

In a forty-fifth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Platelet membrane glycoprotein Mb/llla 
(hGPIIb/llla). The enzymes or the fusion protein can thus be used for preparing 
medicaments for the treatment of diseases, such as, but not limited to, unstable 
angina, carotid stenting, ischemic stroke, peripheral vascular diseases, 
angiogenesis- related diseases and disseminating tumors, as well as other 
diseases connected with hGPIIb/llla. Preferably, said enzyme or said. fusion 
protein is capable of specifically inactivating hGPIIb/llla (SEQ ID NO: 141). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 67/68, 91/92, 129/130, 143/144, 144/145, 
181/182, 208/209, 209/210, 216/217, 239/240, 261/262, 410/411, 532/533, 
556/557, 557/558, 597/598, 650/651 and/or 689/690 (most preferred between 
positions 67/68, 261/262, 410/411, 650/651 and/ or 689/690) in hGPIIb/llla, or 
a peptide bond in proximity to these positions in hGPIIb/llla, or peptide bonds in 
protein targets related to hGPIIb/llla at positions having structural homology or 
sequence homology to these positions. 
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In a forty-sixth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Intercellular adhesion molecule-1 (hICAM-1). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, Crohn's disease, as well as 
other diseases connected with hICAM-1. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hICAM-1 (SEQ ID NO: 142). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 40/41, 88/89, 97/98, 102/103, 128/129, 
131/132, 132/133, 149/150, 150/151, 160/161 and/or 166/167 (most preferred 
between positions 88/89, 102/103, 150/151, 160/161 and/or 166/167) in 
hICAM-1, or a peptide bond in proximity to these positions in hICAM-1, or 
peptide bonds in protein targets related to hi CAM- 1 at positions having structural 
homology or sequence homology to these positions. 

In a forty-seventh embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human TGF-beta receptor type II (hTGF-beta Rll). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, diffuse systemic sclerosis, 
as well as other diseases connected with hTGF-beta Rll. Preferably, said enzyme 
or said fusion protein is capable of specifically inactivating hTGF-beta Rll (SEQ ID 
NO: 145). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 32/33, 34/35, 35/36, 66/67, 
67/68, 69/70, 82/83, 103/104, 104/105, 105/106, 118/119, 122/123 and/or 
130/131 (most preferred between positions 32/33, 34/35, 66/67, 69/70, 
104/105, 122/123 and/or 130/131) in hTGF-beta Rll, or a peptide bond in 
proximity to these positions in hTGF-beta Rll, or peptide bonds in protein targets 
related to hTGF-beta Rll at positions having structural homology or sequence 
homology to these positions. 

In a forty- eighth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Membrane cofactor protein (hMCP). The enzymes 
or the fusion protein can thus be used for preparing medicaments for the 
treatment of diseases, such as, but not limited to, renal tumors, uveal 
melanomas and gastrointestinal tumors, as well as other diseases connected with 
hMCP. Preferably, said enzyme or said fusion protein is capable of specifically 
inactivating hMCP (SEQ ID NO: 146). More preferably said enzyme or said fusion 
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protein is capable of hydrolysing the peptide bonds between positions 15/16, 
17/18, 25/26, 31/32, 32/33, 35/36, 48/49, 67/68, 69/70, 110/111, 119/120 
and/or 125/126 (most preferred between positions 15/16, 32/33, 48/49, 
119/120 and/or 125/126) 130/131) in hMCP, or a peptide bond in proximity to 
these positions in hMCP, or peptide bonds in protein targets related to hMCP at 
positions having structural homology or sequence homology to these positions. 

In a forty-ninth embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Protease activated receptor 1 (hPAR1). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, thrombosis, as well as 
other diseases connected with hPAR1. Preferably, said enzyme or said fusion 
protein is capable of specifically inactivating hPAR1 (SEQ ID NO: 110). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 46/47, 51/52 and/or 52/53 in PARI, or a 
peptide bond in proximity to these positions in hPAR1 , or peptide bonds in 
protein targets related to hPAR1 at positions having structural homology or 
sequence homology to these positions. 

In a fifth embodiment of this variant the enzyme is a protease and is capable of 
hydrolysing human Protease activated receptor 2 (hPAR2). The enzymes or the 
fusion protein can thus be used for preparing medicaments for the treatment of 
diseases, such as, but not limited to, Crohn's disease, Ulcerative colitis and 
Inflammatory bowel disease, asthma, inflammation associated pain and arthritis, 
as well as other diseases connected with hPAR2. Preferably, said enzyme or said 
fusion protein is capable of specifically inactivating hPAR2 (SEQ ID NO:111). 
More preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 41/42, 51/52 and/or 59/60 in hPAR2, or a 
peptide bond in proximity to these positions in hPAR2, or peptide bonds in 
protein targets related to hPAR2 at positions having structural homology or 
sequence homology to these positions. 

In a fifty-first embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human Protease activated receptor 4 (hPAR4). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, thrombosis, as well as 
other diseases connected with hPAR4. Preferably, said enzyme or said fusion 
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protein is capable of specifically inactivating hPAR4 (SEQ ID NO: 11 3). More 
preferably said enzyme or said fusion protein is capable of hydrolysing the 
peptide bonds between positions 68/69, 74/75 and/or 78/79 in hPAR4 f or a 
peptide bond in proximity to these positions in hPAR4, or peptide bonds in 
protein targets related to hPAR4 at positions having structural homology or 
sequence homology to these positions. 

In a fifty-second embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human 5-hydroxytryptamine 1 A receptor (h5-HT-1A). The 
enzymes or the fusion protein can thus be used for preparing medicaments for 
the treatment of diseases, such as, but not limited to, irritable bowel syndrome, 
as well as other diseases connected with h5-HT-1A. Preferably, said enzyme or 
said fusion protein is capable of specifically inactivating h5-HT-1A (SEQ ID 
NO: 11 7). More preferably said enzyme or said fusion protein is capable of 
hydrolysing the peptide bonds between positions 101/102, 102/103, 181/182 
and/or 370/371 in h5-HT-1A a peptide bond in proximity to these positions in 
h5-HT-1A, or peptide bonds in protein targets related to h5-HT-1A at positions 
having structural homology or sequence homology to these positions. 

In a fifty-third embodiment of this variant the enzyme is a protease and is 
capable of hydrolysing human carcinoembryonic antigen (hCEA). The enzymes or 
the fusion protein can thus be used for preparing medicaments for the treatment 
of diseases, such as, but not limited to, colon cancer, as well as other diseases 
connected with hCEA. Preferably, said enzyme or said fusion protein is capable of 
specifically inactivating hCEA (SEQ ID NO: 138). More preferably said enzyme or 
said fusion protein is capable of hydrolysing the peptide bonds between positions 
17/18, 69/70, 71/72, 74/75, 77/78, 98/99, 116/117, 126/127 and/ or 128/129 in 
hCEA, or a peptide bond in proximity to these positions in hCEA, or peptide 
bonds in protein targets related to hCEA at positions having structural homology 
or sequence homology to these positions. 

It is obvious to someone skilled in the art that also polymorphisms of all target 
sequences referred to are included. The expression "proximity to these positions" 
in all embodiments above refer to positions of peptide bonds that are between 10 
and 5 Angstrom and/or 5 amino acids, preferably 3 amino acids, next to the 
positions of the peptide bonds 
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Preferably, in this variant the scaffold of the engineered enzyme provided in step 
(c) is of human origin in order to avoid or reduce immunogenicity or allergenic 
effects associated with the application of the enzyme in the human body. 
Alternatively, immunoenicity and allergenicity can be reduced by deimmunlzatlon 
of the engineered enzyme. 

In further embodiment of this variant, the target substrate is a pro-drug which is 
activated by the engineered enzyme. In a particular embodiment of this variant, 
the engineered enzyme has proteolytic activity and the target substrate is a 
protein target which is proteolytically activated. Examples of such pro-drugs are 
pro-proteins such as the inactivated forms of coagulations factors. In another 
particular variant, the engineered enzyme is an oxidoreductase and the target 
substrate is a chemical that can be activated by oxidation. 

In a second variant of this aspect of the invention, the engineered enzyme is 
used for diagnostic puposes. In a particular embodiment of this variant, the 
engineered enzyme is target-specific protease. Such diagnostic purposes 
comprise but are not limited to applications with the aim of diagnosing diseases, 
testing genetic predispositions or monitoring disease progression during therapy. 
In a particular embodiment, the diagnosis is based on the testing for the 
presence or absence of a disease- specific marker protein or a disease- specific 
variant of a human protein in test samples such as human tissue samples, blood 
samples or other samples taken from patients. The testing employs a protease 
with specificity for a particular, disease- related target protein. The testing is done 
by analysing the proteolytic degradation of such protein in the test sample. 
In a preferred embodiment the aim of the diagnostic test is to detBct and/or 
quantify a disease- specific variant of a native human protein. Such a diagnostic 
test employs a protease that is specific for the disease-related protein variant, 
i.e. it has significantly higher proteolytic activity on the disease- related protein 
variant compared to the native human protein. The disease- related protein 
variant is therefore detected and/or quantified by detecting and/or quantifying 
the activity of the target- specific protease. Such detection and/or quantification 
is done by directly measuring the degradation products of the target protein or 
indirectly by measuring the influence of the target protein on the activity of the 
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target-specific protease by a competition assay. In another preferred 
embodiment the aim of the diagnostic test is to detect and/or quantify a protein 
that is specific for an infection by an infectious agent such as a virus or a 
bacterium. Such a diagnostic test employs a protease that is specific for a protein 
specifically expressed upon infection by the infectious agent, i.e. it has 
significantly higher proteolytic activity on a particular infection-indicating protein 
compared to any other native human protein. The infection- indicating protein is 
therefore detected and/or quantified by detecting and/or quantifying the 
proteolytic activity of the target-specific protease. Such detection and/or 
quantification is done by directly measuring the degradation products of the 
infect ion- indicating protein or indirectly by measuring the influence of the 
infection-indicating protein on the activity of the target- specific protease by a 
competition assay. 

In a third variant of this aspect of the invention, the engineered enzyme is used 
as a technical means in order to catalyze an industrially or nutritionally relevant 
reaction with defined specificity. In a particular embodiment of this variant the 
engineered enzyme has proteolytic activity, the catalyzed reaction is a proteolytic 
processing, and the engineered enzyme specifically hydrolyses one or more 
industrially or nutrionally relevant protein substrates. In a preferred embodiment 
of this variant the engineered enzyme hydrolyses one or more industrially or 
nutrionally relevant protein substrates at specific sites, thereby leading to 
industrially or nutrionally desired product properties such as texture, taste or 
precipitation characteristics. In a further particular embodiment of this variant, 
the engineered enzyme catalyzes the hydrolysis of glycosidic bonds (glycosidase 
or glycosylases activity). Then, preferably, the catalyzed reaction is a 
polysaccharide processing, and the engineered enzyme specifically hydrolyses 
one or more industrially, technically or nutrionally relevant polysaccharide 
substrates. In a further particular embodiment of this variant, the engineered 
enzyme catalyzes the hydrolysis of triglyceride esters or lipids (lipase activity). 
Then, preferably, the catalyzed reaction is a lipid processing step, and the 
engineered enzyme specifically hydrolyses one or more industrially, technically or 
nutrionally relevant lipid substrates. In a further particular variant of this 
embodiment, the engineered enzyme catalyzes the oxidation or reduction of 
substrates (oxidoreductase activity). Then, preferably, the engineered enzyme 
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specifically oxidizes or reduces one or more industrially, technically or nutrionally 
relevant chemical substrates. 

A second aspect of the invention discloses engineered enzymes with defined 
specificities. These engineered enzymes are characterized by the following 
components: 

(a) a protein scaffold capable of catalyzing at least one chemical reaction on a 
substrate, and 

(b) one or more specificity determining regions (SDRs) located at sites in the 
protein scaffold that enable the resulting engineered protein to discriminate 
between ar least one target substrate and one or more different substrates, 
wherein the SDRs are essentially synthetic peptide sequences. 

Preferably, such defined specificity of the engineered enzymes is not conferred 
by the protein scaffold. 

In principle, the protein scaffold can have a variety of primary, secondary and 
tertiary structures. The primary structure, i.e. the amino acid sequence, can be 
an engineered sequence or can be derived from any viral, prokaryotic or 
eukaryotic origin. For human therapeutic use, however, the protein scaffold is 
preferably of mammalian origin, and more preferably, of human origin. 
Furthermore, the protein scaffold is capable to catalyze one or more chemical 
reactions and has preferably only a low specificity. 

Preferably, derivatives of the protein scaffold are used that have modified amino 
acid sequences that confer improved characteristics for the applicability as 
protein scaffolds. Such improved characteristics comprise, but are not limited to, 
stability; expression or secretion yield; folding, in particular after combination of 
the protein scaffold with SDRs; Increased or decreased sensitivity to regulators 
such as activators or inhibitors; immunogenicity; catalytic rate; kM or substrate 
affinity. 

The engineered enzymes reveal their quantitative specificity from the synthetic 
peptide sequences that are combined with the protein scaffold. Therefore, the 
engineered peptide sequences are acting as Specificity Determining Regions or 
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SDRs. The number, the length and the positions of such SDRs can vary over a 
wide range. The number of SDRs within the scaffold is at least one, preferably 
more than one, more preferably between two and eleven, most preferably 
between two and six. The SDRs have a length between one and 50 amino acid 
residues, preferably a length between one and 15 amino acid residues, more 
preferably a length between one and six amino acid residues. Alternatively, the 
SDRs have a length between two and 20 amino acid residues, preferably a length 
between two and ten amino acid residues, more preferably a length between 
three and eight amino acid residues. 

The inventive engineered enzymes can further be desribed as antibody- like 
protein molecules comprising constant and variable regions, but having a non- 
immunoglogulin backbone and having an active site (catalytic activity) in the 
constant region, whereby the substrate specificity of the active site is modulated 
by the variable region. Preferably, as in the immunoglobulin structure, the 
variable regions are loops of variable length and composition that interact with a 
target molecule. 

In a particular variant of the invention, the engineered enzymes have hydrolase 
activity. In a preferred variant, the engineered enzymes have proteolytic activity. 
Particularly preferred protein scaffolds for this variant are unspecific proteases or 
are parts from unspecific proteases or are otherwise derived from unspecific 
proteases. The expressions "derived from" or "a derivative thereof" in this 
respect and in the following variants and embodiments refer to derivatives of 
proteins that are mutated at one or more amino acid positions and/or have a 
homology of at least 70%, preferably 90%, more preferably 95% and most 
preferably 99% to the original protein, and/ or that are proteolytically processed, 
and/or that have an altered glycosylation pattern, and/or that are covalently 
linked to non-protein substances, and/or that are fused with further protein 

■ 

domains, and/or that have C-terminal and/or N-terminal truncations, and/or that 
have specific insertions, substitutions and/or deletions. Alternatively, "derived 
from" may refer to derivatives that are combinations or chimeras of two or more 
fragments from two or more proteins, each of which optionally comprises any or 
all of the aforementioned modifications. The tertiary structure of the protein 
scaffold can be of any type. Preferably, however, the tertiary structure belongs to 
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one of the following structural classes: class S1 (chymotrypsin fold of the serine 
proteases family), class S8 (subtllisin fold of the serine proteases family), class 
SC (carboxypeptidase fold of the serine proteases family), class A1 (pepsin A fold 
of the aspartic proteases), or class C14 (caspase-1 fold of the cysteine 
proteases). Examples of proteases that can serve as the protein scaffold of 
engineered proteolytic enzymes for the use as human therapeutics are or are 
derived from human trypsin, human thrombin, human chymotrypsin, human 
pepsin, human endothiapepsin, human caspases 1 to 14, and/or human furin. 

The defined specificity of the engineered proteolytic enzymes is a measure of 
their ability to discriminate between at least one target peptide or protein 
substrates and one or more further peptide or protein substrates. Preferably, the 
defined specificity refers to the ability to discriminate peptide or protein 
substrates that differ in other positions than the P1 site, more preferably, the 
defined specificity refers to the ability to discriminate peptide or protein 
substrates that differ in other positions than the PI site and the P1* site. Most 
preferably, the engineered proteolytic enzymes distinguish target peptid or 
protein substrates at as many sites as is necessary to preferentially hydrolyse 
the target substrate versus other proteins. As an example, a therapeutically 
useful engineered proteolytic enzyme applied intravenously In the human body 
should be sufficiently specific to discriminate between the target substrate and 
any other protein in the human serum. Preferably, such an engineered 
proteolytic enzyme recognizes and discriminates peptide substrates at three or 
more amino acid positions, more preferably at four or more positions, and even 
more preferably at five or more amino acid positions. These positions may either 
be adjacent or non-adjacent. 

In a first embodiment , the protein scaffold has a tertiary structure or fold equal 
or similar to the tertiary structure or fold of the S1 structural subclass of serine 
proteases, i. e. the chymotrypsin fold, and/or has at least 70% identity on the 
amino acid level to a protein of the S1 structural subclass of serine proteases. It 
is preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 18-25, 38-48, 54-63, 73-86, 122-130, 
148-156, 165-171 and 194-204 in human trypsin I, and more preferably at one 
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or more positions from the group of positions that correspond structurally or by 

* 

amino acid sequence homology to the regions 20-23, 41-45, 57-60, 76-83, 125- 
128, 150-153, 167-169 and 197-201 (numbering of amino acids according to 
SEQ ID NO:1). The number of SDRs to be combined with this type of protein 
scaffold Is preferably between 1 and 10, and more preferably between 2 and 4, 
Preferably, the protein scaffold is equal to or is a derivative or homologue of one 
or more of the following proteins: chymotrypsin, granzyme, kallikrein, trypsin, 
mesotrypsin, neutrophil elastase, pancreatic elastase, enteropeptidase, 
cathepsin, thrombin, ancrod, coagulation factor IXa, coagulation factor Vila, 
coagulation factor Xa 5 activated protein C, urokinase, tissue-type plasminogen 
activator, plasm in, Desmodus-type plasminogen activator. More preferably, the 
protein scaffold is trypsin or thrombin or is a derivative or homologue from 
trypsin or thrombin. For the use as a human therapeutic, the trypsin or thrombin 
scaffold is most preferably of human origin in order to minimize the risk of an 
immune response or an allergenic reaction. 

Preferably, derivatives with improved characteristics derived from human trypsin 
I or from proteins with similar tertiary structure are used. Preferred examples of 
such derivatives are derived from human trypsin I (SEQ ID NO:1) and comprise 
one or more of the following amino acid substitutions E56G; R78W; Y131F; 
A146T; C183R. 

It Is preferred that at least one of two SDRs are Inserted into human trypsin I, or 
a derivative thereof, between residues 42 and 43 (SDR 1) and between 123 and 
124 (SDR 2), respectively (numbering of amino acids according to SEQ ID NO: 1) . 
In addition the SDR 1 has a preferred length of 6 and the SDR 2 has a preferred 
length of 5 amino acids, respectively. In a preferred variant of this embodiment, 
the SDR 1 and SDR 2 sequences comprise one of the amino acid sequences listed 
in table 2. Such engineered proteolytic enzymes have specificity for the target 
substrate B as exemplified in example IV. 

In a further embodiment the protein scaffold belongs to the S8 structural 
subclass of serine proteases and/or has a tertiary structure similar to subtllisln E 
from Bacillus subtllis_and/or has at least 70% identity on the amino acid level to 
a protein of the S8 structural subclass of serine proteases. Preferably, the 
scaffold belongs to the subtilisin family or the human pro-protein convertases. It 
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is preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 6-17, 25-29, 47-55, 59-69, 101-111, 
117-125, 129-137, 139-154, 158-169, 185-195 and 204-225 in subtilisin E from 
Bacillus subtilis, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 59-69, 101-111, 129-137, 158-169 and 204-225 (numbering of amino 
acids according to SEQ ID NO: 7). It is preferred that the protein scaffold is equal 
to or is a derivative or homologue of one or more of the following proteins: 
subtilisin Carlsberg; B. subtilis subtilisin E; subtilisin BPN'; B. licheniformis 
subtilisin; B. lentus subtilisin; Bacillus alcalophilus alkaline protease; proteinase 
K; kexin; human pro-protein convertase; human furin. In a preferred variant, 
subtilisin BPN' or one of the proteins SPC 1 to 7 is used as the protein scaffold. 

In a further embodiment the protein scaffold belongs to the family of aspartic 
proteases and/or has a tertiary structure similar to human pepsin. Preferably, the 
scaffold belongs to the A1 class of proteases and/or has at least 70% identity on 
the amino acid level to a protein of the A1 class of proteases. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 6-18, 49-55, 74-83, 91-97, 112-120, 126-137, 159- 
164, 184-194, 242-247, 262-267 and 277-300 in human pepsin, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 10-15, 75-80, 
114-118, 130-134, 186-191 and 280-296 (numbering of amino acids according 
to SEQ ID NO: 11). It is preferred that the protein scaffold is equal to or is a 
derivative or homologue of one or more of the following proteins: pepsin, 
chymosin, renin, cathepsin, yapsin. Preferably, pepsin or endothiopepsin or a 
derivative or homologue thereof is used as the protein scaffold. 

In a further embodiment the protein scaffold belongs to the cysteine protease 
family and/or has a tertiary structure similar to human caspase 7. Preferably the 
scaffold belongs to the C14 class of cysteine proteases or has at least 70% 
identity on the amino acid level to a protein of the C14 class of cysteine 
proteases. It is preferred that SDRs are inserted into the protein scaffold at one 
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or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 78-91, 144-160, 186-198, 226- 
243 and 271-291 in human caspase 7, and more preferably at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 80-86, 149-157, 190-194 and 233-238 
(numbering of amino acids according to SEQ ID NO:14). It is preferred that the 
protein scaffold is equal to or is a derivative or homologue of one of the caspases 
1 to 9. 

In a further embodiment the protein scaffold belongs to the S11 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S1 1 class of serine proteases and/or has a tertiary structure similar to D-alanyl- 
D- alanine transpeptidase from Streptomyces species K15. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 67-79, 137-150, 191-206, 212-222 and 241-251 in D- 
alanyLD-alanine transpeptidase from Streptomyces species K15, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 70-75, 141-147, 
195-202 and 216-220 (numbering of amino acids according to SEQ ID NO:15). It 
is preferred that the D-alanyl-D-alanine transpeptidase from Streptomyces 
species K15 or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the S21 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S21 class of serine proteases and/or has a tertiary structure similar to assemblin 
from human cytomegalovirus. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 25- 
33, 64-69, 134-155, 162-169 and 217-244 in assemblin from human 
cytomegalovirus, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 27-31, 164-168 and 222-239 (numbering of amino acids according to 
SEQ ID NO:16). It is preferred that the assemblin from human cytomegalovirus 
or a derivative or homologue thereof is used as the scaffold. 
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In a further embodiment the protein scaffold belongs to the S26 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S26 class of serine proteases and/or has a tertiary structure similar to the signal 
peptidase from Escherichia coli. It is preferred that SDRs are Inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 8-14, 
57-68, 125-134, 239-254, 200-211 and 228-239 in signal peptidase from 
Escherichia coii, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 9-13, 60-67, 127-132 and 203-209 (numbering of amino acids according 
to SEQ ID NO: 17). It is preferred that the signal peptidase from Escherichia coll 
or a derivative or homologue thereof is used as the scaffold. 

In an further embodiment the protein scaffold belongs to the S33 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S33 class of serine proteases and/or has a tertiary structure similar to the prolyl 
aminopeptidase from Serratia marcescens. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 47- 
54, 152-160, 203-212 and 297-302 in prolyl aminopeptidase from Serratia 
marcescens, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 50-53, 154-158 and 206-210 (numbering of amino acids according to 
SEQ ID NO: 18). It is preferred that the prolyl aminopeptidase from Serratia 
marcescens or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the S51 class of serine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
S51 class of serine proteases and/or has a tertiary structure similar to aspartyl 
dipeptidase from Escherichia coli. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 8-16, 
38-46, 85-92, 132-140, 159-170 and 205-211 in aspartyl dipeptidase from 
Escherichia coli, and more preferably at one or more positions from the group of 
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positions that correspond structurally or by amino acid sequence homology to the 
regions 10-14, 87-90, 134-138 and 160-165 (numbering of amino acids 
according to SEQ ID NO: 19). It is preferred that the aspartyl dipeptidase from 
Escherichia coli or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the A2 class of aspartic 
proteases or has at least 70% identity on the amino acid level to a protein of the 
A2 class of aspartic proteases and/or has a tertiary structure similar to the 
protease from human immunodeficiency virus. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 5-12, 17-23, 27-30, 33-38 and 77-83 in protease from human 
immunodeficiency virus, and more preferably at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 7-10, 18-21, 34-37 and 79-82 (numbering of amino 
acids according to SEQ ID NO: 20). It is preferred that the protease from human 
immunodeficiency virus, preferably HIV-1 protease, or a derivative or homologue 
thereof is used as the scaffold. 

In an further embodiment the protein scaffold belongs to the A26 class of 
aspartic proteases or has at least 70% identity on the amino acid level to a 
protein of the A26 class of aspartic proteases and/or has a tertiary structure 
similar to the omptin from Escherichia coli. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 28- 
40, 86-98, 150-168, 213-219 and 267-278 In omptin from Escherichia coli, and 
more preferably at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 33- 
38, 161-168 and 273-277 (numbering of amino acids according to SEQ ID 
NO:21), It is preferred that the omptin from Escherichia coli or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold bebngs to the C1 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C1 class of cysteine proteases and/or has a tertiary structure similar to the 
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papain from Carica papaya. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 17-24, 61-68, 
88-95, 135-142, 153-158 and 176-184 In papain from Carica papaya, and more 
preferably at one of more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 63-66, 136-139 
and 177-181 (numbering of amino acids according to SEQ ID NO: 22). It is 
preferred that the papain from Carica papaya or a derivative or homologue 
thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C2 class of cysteine 
proteases or has at least 70% Identity on the amino acid level to a protein of the 
C2 class of cysteine proteases and/or has a tertiary structure similar to human 
calpain-2. It is preferred that SDRs are inserted into the protein scaffold at one 
or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 90-103, 160-172, 193-199, 243- 
260, 286-294 and 316-322 in human calpain-2, and more preferably at one or 
more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 92-101, 245-250 and 287-291 
(numbering of amino acids according to SEQ ID NO:23). It is preferred that the 
human calpain-2 or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C4 class of cysteine 
proteases or has at least 70% Identity on the amino acid level to a protein of the 
C4 class of cysteine proteases and/ or has a tertiary structure similar to Nla 
protease from tobacco etch virus. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 23- 
31, 112-120, 144-150, 168-176 and 205-218 in Nla protease from tobacco etch 
virus, and more preferably at one or more positions from the group of positions 
that correspond structurally or by amino acid sequence homology to the regions 
145-149, 169-1-74 and 212-218 (numbering of amino acids according to SEQ ID 
NO: 24). It is preferred that the Nla protease from tobacco etch virus (TEV 
protease) or a derivative or homologue thereof is used as the scaffold. 
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!n a further embodiment the protein scaffold belongs to the C1 0 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C10 class of cysteine proteases and/or has a tertiary structure similar to the 
streptopain from Streptococcus pyogenes. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 81- 
90, 133-140, 150-164, 191-199, 219-229, 246-256, 306-312 and 330-337 in 
streptopain from Streptococcus pyogenes, and more preferably at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 82-87, 134-138, 250-254 and 331-335 
(numbering of amino acids according to SEQ ID NO:25). It is preferred that the 
streptopain from Streptococcus pyogenes or a derivative or homologue thereof is 
used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C1 9 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C19 class of cysteine proteases and/or has a tertiary structure similar to human 
ubiquitin specific protease 7. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 3-15, 
63-70, 80-86, 248-256, 272-283 and 292-304 in human ubiquitin specific 
protease 7, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 10-15, 251-255, 277-281 and 298-304 (numbering of amino acids 
according to SEQ ID NO: 26). It is preferred that the human ubiquitin specific 
protease 7 or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C47 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C47 class of cysteine proteases and/or has a tertiary structure similar to the 
staphopain from Staphylococcus aureus. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino add sequence homology to the regions 15- 
23, 57-66, 108-119, 142-149 and 157-164 in staphopain from Staphylococcus 
aureus, and more preferably at one or more positions from the group of positions 
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that correspond structurally or by amino acid sequence homology to the regions 
17-22, 111-117, 143-147 and 159-163 (numbering of amino acids according to 
SEQ ID NO:27). It is preferred that the staphopain from Staphylococcus aureus 
or a derivative or homologue thereof is used as the scaffold. 

In an further embodiment the protein scaffold belongs to the C48 class of 
cysteine proteases or has at least 70% identity on the amino acid level to a 
protein of the C48 class of cysteine proteases and/or has a tertiary structure 
similar to the Ulp1 endopeptidase from Saccharomyces cerevisiae. It is preferred 
that SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 40-51, 108-115, 132-141, 173-179 and 597-605 in 
Ulp1 endopeptidase from Saccharomyces cerevisiae, and more preferably at one 
or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 43-49, 110-113, 133-137 and 
175-178 (numbering of amino acids according to SEQ ID NO:28). It is preferred 
that the Ulp1 endopeptidase from Saccharomyces cerevisiae or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the C56 class of cysteine 
proteases or has at least 70% identity on the amino acid level to a protein of the 
C56 class of cysteine proteases and/or has a tertiary structure similar to the Pfpl 
endopeptidase from Pyrococcus horikoshii. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 8-16, 
40-47, 66-73, 118-125 and 147-153 in Pfpl endopeptidase from Pyrococcus 
horikoshii, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 9-14, 68-71, 120-123 and 148-151 (numbering of amino acids according 
to SEQ ID NO: 29). It is preferred that the Pfpl endopeptidase from Pyrococcus 
horikoshii or a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the M4 class of metallo 
proteases or has at least 70% identity on the amino acid level to a protein of the 
M4 class of metallo proteases and/or has a tertiary structure similar to 
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therrnolysin from Bacillus therm oproteolyticus. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 106-118, 125-130, 152-160, 197-204, 210-213 and 221-229 in 
therrnolysin from Bacillus thermoproteolyticus, and more preferably at one or 
more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 108-115, 126-129, 199-203 and 
223-227 (numbering of amino acids according to SEQ ID NO:30). It is preferred 
that the therrnolysin from Bacillus thermoproteolyticus or a derivative or 
homologue thereof is used as the scaffold. 

in a further embodiment the protein scaffold belongs to the M10 class of metallo 
proteases or has at least 70% identity on the amino acid level to a protein of the 
M10 class of metallo proteases and/or has a tertiary structure similar to human 
collagenase. It is preferred that SDRs are inserted into the protein scaffold at one 
or more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 2-7, 68-79, 85-90, 107-111 and 
135-141 in human collagenase, and more preferably at one or more positions 
from the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 3-6, 71-78 and 136-140 (numbering of amino 
acids according to SEQ ID NO:31). It is preferred that human collagenase or a 
derivative or homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have glycosidase activity. A 
particularly suited protein scaffold for this variant is a glycosylase or is derived 
from a glycosylase. Preferably, the tertiary structure belongs to one of the 
following structural classes: class GH13, GH7, GH12, GH11, GH10, GH28, GH26, 
and GH18 (beta/alpha)8 barrel. 

In a first embodiment the protein scaffold belongs to the GH13 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH13 class of glycosylases and/or has a tertiary structure similar to human 
pancreatic alpha- amylase. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 50-60, 100-110, 
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148-167, 235-244, 302-310 and 346-359 in human pancreatic alpha-amylase, 
and more preferably at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 51- 
58, 148-155 and 303-309 (numbering of amino acids according to SEQ ID 
NO:32). It Is preferred that human pancreatic alpha-amylase or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GH7 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH7 class of glycosylases and/or has a tertiary structure similar to cellulase 
from Trichoderma reesei. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 47-56, 93-104, 
173-182, 215-223, 229-236 and 322-334 in cellulase from Trichoderma reesei, 
and more preferably at one or more positions from the group of positions that 
correspond structurally or by amino add sequence homology to the regions 175- 
180, 218-222 and 324-332 (numbering of amino acids according to SEQ ID 
NO:33). It is preferred that cellulase from Trichoderma reesei or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GH12 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH12 class of glycosylases and/or has a tertiary structure similar to cellulase 
from Aspergillus niger. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 18-28, 55-60, 
106-113, 126-132 and 149-159 in cellulase from Aspergillus niger, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 20-26, 56-59, 
108-1 12 and 151-156 (numbering of amino acids according to SEQ ID NO:34). It 
is preferred that cellulase from Aspergillus niger or a derivative or homologue 
thereof is used as the scaffold. 

> 

In a further embodiment the protein scaffold belongs to the GH11 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
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the GH11 class of glycosylases and/or has a tertiary structure similar to xylanase 
from Aspergillus nlger. It is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 7-14, 33-39, 88- 

97, 114-126 and 158-167 in xylanase from Aspergillus niger, and more 

* 

preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 20-26, 56-59, 
108-11 2 and 151-156 (numbering of amino acids according to SEQ ID NO: 35). It 
is preferred that xylanase from Aspergillus niger or a derivative or homologue 
thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GH10 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH10 class of glycosylases and/or has a tertiary structure similar to xylanase 
from Streptomyces lividans. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 21- 
29, 42-50, 84-92, 130-136, 206-217 and 269-278 in xylanase from 
Streptomyces lividans, and more preferably at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 43-49, 86-90, 208-213 and 271-276 (numbering of 
amino acids according to SEQ ID NO: 36). It is preferred that xylanase from 
Streptomyces lividans or a derivative or homologue thereof is used as the 
scaffold. 

In a further embodiment the protein scaffold belongs to the GH28 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH28 class of glycosylases and/or has a tertiary structure similar to 
pectinase from Aspergillus niger. It is preferred that SDRs are Inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 82- 
88, 118-126, 171-178, 228-236, 256-264 and 289-299 in pectinase from 
Aspergillus niger, and more preferably at one or more positions from the group 
of positions that correspond structurally or by amino acid sequence homology to 
the regions 116-124, 174-178 and 291-296 (numbering of amino acids according 
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to SEQ ID NO:37). It is preferred that pectlnase from Aspergillus nlger or a 
derivative or hornologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GH26 class of 
glycosylases or has at least 70% identity on the amino acid level to a protein of 
the GH26 class of glycosylases and/or has a tertiary structure similar to 
mannanase from Rseudomonas cellulosa. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 75- 
83, 113-125, 174-182, 217-224, 247-254, 324-332 and 325-340 in mannanase 
from Pseudomonas cellulosa, and more preferably at one or more positions from 
the group of positions that correspond structurally or by amino acid sequence 
homology to the regions 1 15-123, 176-180, 286-291 and 328-337 (numbering of 
amino acids according to SEQ ID NO:38). It is preferred that mannanase from 
Pseudomonas cellulosa or a derivative or hornologue thereof is used as the 
scaffold. 

In an further embodiment the protein scaffold belongs to the GH18 (beta/alpha)8 
barrel class of glycosylases or has at least 70% identity on the amino acid level 
to a protein of the GH18 class of glycosylases and/or has a tertiary structure 
similar to chitlnase from Bacillus circulans. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 21- 
29, 57-65, 130-136, 176-183, 221-229, 249-257 and 327-337 in chitinase from 
Bacillus circulans, and more preferably at one or more positions from the group 
of positions that correspond structurally or by amino acid sequence homology to 
the regions 59-63, 178-181, 250-254 and 330-336 (numbering of amino acids 
according to SEQ ID NO:39). It is preferred that chitinase from Bacillus circulans 
or a derivative or hornologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have esterhydrolase activity. 
Preferably, the protein scaffold for this variant have lipase, phosphatase, 
phytase, or phosphodiesterase activity. 



WO 2004/1 13522 



PCT/EP2004/051173 



60 

In a first embodiment the protein scaffold belongs to the GX class of esterases or 
has at least 70% identity on the amino acid level to a protein of the GX class of 
esterases and/or has a tertiary structure similar to the structure of the lipase B 
from Candida antarctica. Preferably, the scaffold has lipase activity. It is 
preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 139-148, 188-195, 216-224, 256-266, 
272-287 in lipase B from Candida antarctica, and more preferably at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 141-146, 218-222, 259-263 and 275-283 
(numbering of amino acids according to SEQ ID NO:40). It is preferred that 
lipase B from Candida antarctica or a derivative or homologue thereof is used as 
the scaffold. 

In a further embodiment the protein scaffold belongs to the GX class of esterases 
or has at least 70% identity on the amino acid level to a protein of the GX class 
of esterases and/or has a tertiary structure similar to the pancreatic lipase from 
guinea pig. Preferably, the scaffold has lipase activity. It is preferred that SDRs 
are inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 78-90, 91-100, 112-120, 179-186, 207-218, 238-247 and 248-260 in 
pancreatic lipase from guinea pig, and more preferably at one or more positions 
from the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 80-87, 114-118, 209-215 and 239-246 
(numbering of amino acids according to SEQ ID NO:41). It is preferred that 
pancreatic lipase from guinea pig or a derivative or homologue thereof is used as 
the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the alkaline phosphatase from Escherichia coli or has at least 
70% identity on the amino acid level to a protein that has a tertiary structure 
similar to the structure of the alkaline phosphatase from Escherichia coli. 
Preferably, the scaffold has phosphatase activity. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
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regions 110-122, 187-142, 170-175, 186-193, 280-287 and 425-435 in alkaline 
phosphatase from Escherichia coli, and more preferably at one or more positions 
from the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 171-174, 187-191, 282-286 and 426-433 
(numbering of amino acids according to SEQ ID NO:42). It is preferred that 
alkaline phosphatase from Escherichia coli or a derivative or homologue thereof 
is used as the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the bovine pancreatic desoxyribonuclease I or has at least 70% 
identity on the amino acid level to a protein that has a tertiary structure similar 
to the structure of the bovine pancreatic desoxyribonuclease I. Preferably, the 
scaffold has phosphodiesterase activity. More preferably, a nuclease, and most 
preferably, an unspecific endonuclease or a derivative thereof is used as the 
scaffold. It is preferred that SDRs are inserted into the protein scaffold at one or 
more positions from the group of positions that correspond structurally or by 
amino acid sequence homology to the regions 14-21, 41-47, 72-77, 97-111, 
135-143, 171-178, 202-209 and 242-251 in bovine pancreatic 
desoxyribonuclease I, and more preferably at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 16-19, 42-46, 136-141 and 172-176 (numbering of 
amino acids according to SEQ ID NO:43). It is preferred that bovine pancreatic 
desoxyribonuclease I or human desoxyribonuclease I or a derivative or 
homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzyme has transferase activity. A 
particularly suited protein scaffold for this variant is a glycosyl-, a phospho- or a 
methyltransferase, or is a derivative thereof. Particularly preferred protein 
scaffolds for this variant are glycosyltransferases or are derived from 
glycosyltransferases. The tertiary structure of the protein scaffold can be of any 
type. Preferably, however, the tertiary structure belongs to one of the following 
structural classes: GH13 and GT1 . 

In a first embodiment the protein scaffold belongs to the GH13 class of 
transferases or has at least 70% identity on the amino acid level to a protein of 



WO 2004/113522 



PCT/EP2004/051 173 



62 

the GH13 class of transferases and/or has a tertiary structure similar to the 
structure of the cyclomaltodextrin glucanotransferase from Bacillus circulans. 
Preferably, the scaffold has transferase activity, and more preferably a 
glycosyltransferase is used as the scaffold. It is preferred that SDRs are inserted 
into the protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 38- 
48, 85-94, 142-154, 178-186, 259-266, 331-340 and 367-377 in 
cyclomaltodextrin glucanotransferase from Bacillus circulans, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 87-92, 180-185, 
261-264 and 269-275 (numbering of amino acids according to SEQ ID NO:44). It 
is preferred that cyclomaltodextrin glucanotransferase from Bacillus circulans or 
a derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold belongs to the GT1 class of 
tranferases or has at least 70% identity on the amino acid level to a protein of 
the GT1 class of transferases and/or has a tertiary structure similar to the 
structure of the glycosyltransferase from Amycolatopsis orientalis A82846. 
Preferably the scaffold has transferase activity, and more preferably 
glycosyltransferase activity. It is preferred that SDRs are inserted into the 
protein scaffold at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 58- 
74, 130-138, 185-193, 228-236 and 314-323 in glycosyltransferase from 
Amycolatopsis orientalis A82846, and more preferably at one or more positions 
from the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 61-71, 230-234 and 316-321 (numbering of 
amino acids according to SEQ ID NO:45). It is preferred that the 
glycosyltransferase from Amycolatopsis orientalis A82846 or a derivative or 
homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have oxidoreductase activity. 
A particularly suited protein scaffold for this variant is a monooxygenase, a 
dioxygenase or a alcohol dehydrogenase, or a derivative thereof. The tertiary 
structure of the protein scaffold can be of any type. 
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In a first embodiment the protein scaffold has a tertiary structure similar to the 
structure of the 2,3-diphydroxybiphenyl dioxygenase from Pseudomonas sp. or 
has at least 70% identity on the amino acid level to a protein that has a tertiary 
structure similar to the structure of the 2,3-diphydroxybiphenyl dioxygenase 
from Pseudomonas sp. Preferably, the scaffold has dioxygenase activity. It is 
preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 172-185, 198-206 , 231-237, 250-259 
and 282-287 in 2,3-diphydroxybiphenyl dioxygenase from Pseudomonas sp., and 
more preferably at one or more positions from the group of positions that 
correspond structurally or by amino acid sequence homology to the regions 175- 
182, 200-204, 252-257 and 284-287 (numbering of amino acids according to 
SEQ ID NO: 46). It is preferred that the 2,3-diphydroxybiphenyl dioxygenase 
from Pseudomonas sp or a derivative or homologue thereof is used as the 
scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the catechol dioxygenase from Acinetobacter sp. or has at least 
70% identity on the amino acid level to a protein that has a tertiary •structure 
similar to the structure of the catechol dioxygenase from Acinetobacter sp.. 
Preferably, the scaffold has dioxygenase activity, and more preferably catechol 
dioxygenase activity, it is preferred that SDRs are inserted into the protein 
scaffold at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 66-72, 105-112, 
156-171 and 198-207 in catechol dioxygenase from Acinetobacter sp., and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 107-110, 161- 
171 and 201-205 (numbering of amino acids according to SEQ ID NO:47). It is 
preferred that the catechol dioxygenase from Acinetobacter sp or a derivative or 
homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the camphor-5-monooxygenase from Pseudomonas putida or has 
at least 70% identity on the amino acid level to a protein that has a tertiary 
structure similar to the structure of the camphor-5-monooxygenase from 
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Pseudomonas putida. Preferably, the scaffold has monooxygenase activity, and 
more preferably camphor monooxygenase activity. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 26-31, 57-63, 84-98, 182-191, 242-256, 292-299 and 392-399 in 
camphor-5-monooxygenase from Pseudomonas putida, and more preferably at 
one or more positions from the group of positions that correspond structurally or 
by amino acid sequence homology to the regions 85-96, 183-188, 244-253, 293- 
298 and 393-398 (numbering of amino acids according to SEQ ID NO: 48). It is 
preferred that the camphor-5-monooxygenase from Pseudomonas putida or a 
derivative or homologue thereof is used as the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the alcohol dehydrogenase from Equus callabus or has at least 
70% identity on the amino acid level to a protein that has a tertiary structure 
similar to the structure of the alcohol dehydrogenase from Equus callabus. 
Preferably, the scaffold has alcohol dehydrogenase activity. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 49-63, 111-112, 294-301 and 361-369 in alcohol 
dehydrogenase from Equus callabus, and more preferably at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 51-61 and 295-299 (numbering of amino 
acids according to SEQ ID NO:49). It is preferred that the alcohol dehydrogenase 
from Equus callabus or a derivative or homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have lyase activity. A 
particularly suited protein scaffold for this variant is a oxoacid lyase or is a 
derivative thereof. Particularly preferred protein scaffolds for this variant are 
aldolases or synthases, or are derived thereof. The tertiary structure of the 
protein scaffold can be of any type, but a (beta/alpha)8 barrel structure is 
preferred. 

In a first embodiment the protein scaffold has a tertiary structure similar to the 
structure of the N-acetyl-d-neuramic acid aldolase from Escherichia coli or has at 
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least 70% identity on the amino acid level to a protein that has a tertiary 
structure similar to the structure of the N-acetyl-d-neuramic acid aldolase from 
Escherichia coli. Preferably, the scaffold has aldolase activity. It Is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 45-55, 78-87, 105-113, 137-146, 164-171, 187-193, 
205-210, 244-255 and 269-276 in N- acetyl- d-neuramic acid aldolase from 
Escherichia coli, and more preferably at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 45-52, 138-144, 189-192, 247-253 and 271-275 (numbering of amino 
acids according to SEQ ID NO:50). It is preferred that the fst-acetyl-d-neuramic 
acid aldolase from Escherichia coli or a derivative or homologue thereof is used 
as the scaffold. 

In a further embodiment the protein scaffold has a tertiary structure similar to 
the structure of the tryptophan synthase from Salmonella typhimurium or has at 
least 70% identity on the amino acid level to a protein that has a tertiary 
structure similar to the structure of the tryptophan synthase from Salmonella 
typhimurium. Preferably, the scaffold has synthase activity. It is preferred that 
SDRs are inserted into the protein scaffold at one or more positions from the 
group of positions that correspond structurally, or by amino acid sequence 
homology to the regions 56-63, 127-134, 154-161, 175-193, 209-216 and 230- 
240 in tryptophan synthase from Salmonella typhimurium, and more preferably 
at one or more positions from the group of positions that correspond structurally 
or by amino acid sequence homology to the regions 57-62, 155-160, 178-190 
and 210-215 (numbering of amino acids according to SEQ ID NO: 51). It is 
preferred that the tryptophan synthase from Salmonella typhimurium or a 
derivative or homologue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have isomerase activity. A 
particularly suited protein scaffold for this variant is a converting aldose or a 
converting ketose, or is a derivative thereof. 

In a first embodiment, the protein scaffold has a tertiary structure similar to the 
structure of the xylose isomerase from Actinoplanes missouriensis or has at least 
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70% identity on the amino acid level to a protein that has a tertiary structure 
similar to the structure of the xylose isomerase from Actinoplanes missouriensis. 
It is preferred that SDRs are inserted into the protein scaffold at one or more 
positions from the group of positions that correspond structurally or by amino 
acid sequence homology to the regions 18-31, 92-103, 136-147, 178-188 and 
250-257 in xylose, isomerase from Actinoplanes missouriensis, and more 
preferably at one or more positions from the group of positions that correspond 
structurally or by amino acid sequence homology to the regions 20-27, 92-99 
and 180-186 (numbering of amino acids according to SEQ ID NO: 52). It is 
preferred that the xylose isomerase from Actinoplanes missouriensis or a 
derivative or homoiogue thereof is used as the scaffold. 

It is further preferred that the engineered enzymes have ligase activity. A 
particularly suited protein scaffold for this variant is a DNA ligase, or is eT 
derivative thereof. 

In a first embodiment, the protein scaffold has a tertiary structure similar to the 
structure of the DNA ligase from Bacteriophage T7 or has at least 70% identity 
on the amino acid level to a protein that has a tertiary structure similar to the 
structure of the DNA-iigase from Bacteriophage T7. It is preferred that SDRs are 
inserted into the protein scaffold at one or more positions from the group of 
positions that correspond structurally or by amino acid sequence homology to the 
regions 52-60, 94-108, 119-131, 241-248, 255-263 and 302-318 in DNA ligase 
from Bacteriophage T7, and more preferably at one or more positions from the 
group of positions that correspond structurally or by amino acid sequence 
homology to the regions 96-106, 121-129, 256-262 and 304-316 (numbering of 
amino acids according to SEQ ID NO:53). It is preferred that the DNA ligase from 
Bacteriophage T7 or a derivative or homoiogue thereof is used as the scaffold. 

A third aspect of the invention Is directed to a method for generating engineered 
enzymes with specificities that are qualitatively and/ or quantitatively novel in 
combination with the protein scaffold. The inventive method comprises at least 
the following steps: 

(a) providing a protein scaffold capable to catalyze at least one chemical 
reaction on at least one target substrate, 
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(b) generating a library of engineered enzymes or isolated engineered 
enzymes by combining the protein scaffold from step (a) with one or more 
fully or partially random peptide sequences at sites in the protein scaffold that 
enable the resulting engineered enzyme to discriminate between at least one 
target substrate and one or more different substrates and 

(c) selecting out of the library of engineered enzymes generated in step (b) 
one or more enzymes that have defined specificities towards at least one 
target substrate. 

In a first variant of this aspect of the invention, the inventive method comprises 
at least the following steps: 

(a) providing a protein scaffold capable to catalyze at least one chemical 
reaction on at least one target substrate, 

(b) generating a library of engineered enzymes or isolated engineered 
enzymes by inserting into the protein scaffold from step (a) one or more fully 
or partially random peptide sequences at sites in the protein scaffold that 
enable the resulting engineered enzyme to discriminate between at least one 
target substrate and one or more different substrates and 

(c) selecting out of the library of engineered enzymes generated in step (b) 
one or more enzymes that have defined specificities towards at least one 
target substrate. 

Preferably, the positions at which the one or more fully or partially random 
peptide sequences are combined with or inserted into the protein scaffold are 
identified prior to the combination or insertion. 

The number of insertions or other combinations of fully or partially random 
peptide sequences as well as their length may vary over a wide range. The 
number is at least one, preferably more than one, more preferably between two 
and eleven, most preferably between two and six. The length of such fully or 
partially random peptide sequences is usually less than 50 amino acid residues. 
Preferably, the length is between one and 15 amino acid residues, more 
preferably between one and six amino acid residues. Alternatively, the length Is 
between two and 20 amino acid residues, preferably between two and ten amino 
acid residues, more preferably between three and eight amino acid residues. 
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Preferably such insertions or other combinations are performed on the DNA level, 
using polynucleotides encoding such protein scaffolds and polynucleotides or 
oligonucleotides encoding such fully or partially random peptide sequences. 

Optionally, steps (a) to (c) are repeated cyclically, whereby enzymes selected in 
step (c) serve as the protein scaffold in step (a) of a further cycle, and 
randomized peptide sequences are either inserted or, alternatively, substituted 
for peptide sequences that have been inserted in former cycles. Thereby, the 
number of inserted peptide sequences is either constant or increases over the 
cycles. The cycles are repeated until one or more enzymes with the intended 
specificities are generated. 

Moreover, during or after one or more rounds of steps (a) to (c), the scaffold 
may be mutated at one or more positions in order to make the scaffold more 
acceptable for the combination with SDR sequences, and/or to increase catalytic 
activity at a specific pH and temperature, and/or to change the glycosylation 
pattern, and/or to decrease sensitivity towards enzyme inhibitors, and/or to 
change enzyme stability. 

In a second variant of this aspect of the invention, the inventive method 
comprises at least the following steps: 

(a) providing a first protein scaffold fragment, 

(b) connecting said protein scaffold fragment via a peptide linkage with a first 
SDR, and optionally 

(c) connecting the product of step (b) via a peptide linkage with a further SDR 
peptide or with a further protein scaffold fragment, and optionally 

(d) repeating step (c) for as many cycles as necessary in order to generate a 
sufficiently specific enzyme, and 

(e) selecting out of the population generated in steps (a) - (d) one or more 
enzymes that have the desired specificities toward the one or more target 
substrates. 

Protein scaffold fragment means a part of the sequence of a protein scaffold. A 
protein scaffold is comprised of at least two protein scaffold fragments. 
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In a third variant of this aspect of the invention, the protein scaffold, the SDRs 
and the engineered enzyme are encoded by a DNA sequence and an expression 
system is used in order to produce the protein. In an alternative variant, the 
protein scaffold, the SDRs and/or the engineered enzyme are chemically 
synthesized from peptide building blocks. 

In a fourth variant of this aspect of the invention, the inventive method 
comprises at least the following steps: 

(a) providing a polynucleotide encoding a protein scaffold capable of catalyzing 
one or more chemical reactions on one or more target substrates; 

(b) combining one or more fully or partially random oligonucleotide sequence 
with the polynucleotide encoding the protein scaffold, the fully or partially 
random oligonucleotide sequences being located at sites in the polynucleotide 
that enable the encoded engineered enzyme to discriminate between the one or 
more target substrates and one or more other substrates; and 

(c) selecting out of the population generated in step (b) one or more 
polynucleotides that encode enzymes that have the defined specificities toward 
the one or more target substrates. 

Any enzyme can serve as the protein scaffold in step (a). It can be a naturally 
occurring enzyme, a varianror a truncated derivate therefore, or an engineered 
enzyme. For human therapeutic use, the protein scaffold is preferably a 
mammalian enzyme, and more preferably a human enzyme. In that aspect, the 
invention is directed to a method for the generation of essentially mammalian, 
especially of essentially human enzymes with specificities that are different from 
specificities of any enzyme encoded in mammalian genomes or in the human 
genome, respectively. 

According to the invention, the protein scaffold provided in step (a) of this aspect 
requires to be capable of catalyzing one or more chemical reactions on a target 
substrate. Therefore, a protein scaffold is selected from the group of potential 
protein scaffolds by its activity on the target substrate. 

In a preferred variant of this aspect of the invention, a protein scaffold with 
hydrolase activity is used. Preferably, a protein scaffold with proteolytic activity 
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Is used, and more preferably, a protease with very low specificity having basic 
activity on the target substrate is used as the protein scaffold. Examples of 
proteases from different structural classes with low substrate specificity are 
Papain, Trypsin, Chymotrypsin, Subtilisin, SET (trypsin- like serine protease from 
Streptomyces erythraeus), Bastase, Cathepsin G or Chymase. Before being 
employed as the protein scaffold, the amino acid sequence of the protease may 
be modified in order to change protein properties other than specificity, e.g 
catalytic activity, stability, inhibitor sensitivity, or expression yield, essentially as 
described in WO 92/18645, or in order to change specificity, essentially as 
described in EP 02020576.3 and PCT/EP03/ 04864. 

Another option for a feasible protein scaffold are lipases. Hepatic lipase, 
lipoprotein lipase and pancreatic lipase belong to the "lipoprotein lipase 
superfamily", which in turn is an example of the GX-class of lipases (M. Fischer, 
J. Pleiss (2003), Nucl. Acid. Res., 31, 319-321). The substrate specificity of 
lipases can be characterized by their relative activity towards triglycerol esters of 
fatty acids and phospholipids, bearing a charged head group. Alternatively, other 
hydrolases such as esterases, glycosylases, amidases, or nitrilases may be used 
as scaffolds. 

Transferases are also feasible protein scaffolds. Glycoslytransferases are involved 
in many biological synthesis involving a variety of donors and acceptors. 
Alternatively, the protein scaffold may have ligase, lyase, oxidoreductase, or 
isom erase activity. 

In a first emb odiment , the one or more fully or partially random peptide 
sequences are inserted at specific sites in the protein scaffold. These insertion 
sites are characterized by the fact that the inserted peptide sequences can act as 
discriminators between different substrates, i.e. as Specificity Determining 
Regions or SDRs. Such insertion sites can be identified by several approaches. 
Preferably, insertion sites are identified by analysis of the three-dimensional 
structure of the protein scaffolds, by comparative analysis of the primary 
sequences of the protein scaffold with other enzymes having different 
quantitative specificities, or experimentally by techniques such as alanine 
scanning, random mutagenesis, or random deletion, or by any combination 
thereof. 
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A first approach to identify insertion sites for SDRs bases on the three- 
dimensional structure of the protein scaffold as it can be obtained by x-ray 
crystallography or by nuclear magnetic resonance studies. Structural alignment 
of the protein scaffold in comparison with other enzymes of the same structural 
class but having different quantitative specificities reveals regions of high 
structural similarity and regions with low structural similarity. Such an analysis 
can for example be done using public software such as Swiss PDB viewer (Guex, 
N. and Peitsch, M.C. (1997) Bectrophoresis 18, 2714-2723). Regions of low 
structural similarity are preferred SDR insertion sites. 

In a second approach to identify insertion sites for SDRs, three-dimensional 
structures of the scaffold protein in complex with competitive inhibitors or 
substrate analogs are analysed. It is assumed that the binding site of a 
competitive inhibitor significantly overlaps with the binding site of the substrate. 
In that case, atoms of the protein that are within a certain distance of atoms of 
the inhibitor are likely to be in a similar distance to the substrate as well. 
Choosing a short distance, e.g. < 5 A, will result in an ensemble of protein atoms 
that are in close contact with the substrate. These residues would constitute the 
first shell contacts and are therefore preferred insertion sites for SDRs. Once first 
shell contacts have been identified, second shell contacts can be found by 
repeating the distance analysis starting from first shell atoms. In yet another 
alternative of the invention the distance analysis described above Is performed 
starting from the active site residues. 

In third approach to identify insertion sites for SDRs, the primary sequence of 
the scaffold protein is aligned with other enzymes of the same structural class 
but having different quantitative specificities using an alignment algorithm. 
Examples of such alignment algorithms are published (Altschul, S.F., Gish, W., 
Miller, W., Myers, E.W. & Lipman, D.J. (1990) J. Mol. Biol. 215:403-410; 
"Statistical methods in Bioinformatics: an introduction" by Ewens, W. & Grant, 
G.R. 2001, Springer, New York). Such an alignment may reveal conserved and 
non-conserved regions with varying sequence homology, and, in particular, 
additional sequence elements in one or more enzymes compared to the scaffold 
protein. Conserved regions of are more likely to contribute to phenotypes shared 
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among the different proteins, e.g. stabilizing the three-dimensional fold. Non- 
conserved regions and, in particular, additional sequences in enzymes with 
quantitatively higher specificity (Turner, R. et ai. (2002) J. Biol. Chem., 277, 
33068-33074) are preferred insertion sites for SDRs. 

For proteases currently five families are known, namely aspartic-, cysteine-, 
serine-, metallo- and threonine proteases. Each family includes groups of 
. proteases that share a similar fold. Crystallographic structures of members of 
these groups have been solved and are accessible through public databases, e.g. 
the Brookhaven protein database (H.M. Berman et al. Nucleic Acids Research, 28 
pp. 235-242 (2000)). Such databases also include structural homologs in other 
enzyme classes and nonenzymatically active proteins of each class. Several tools 
are available to search public databases for structural homoiogues: SCOP - a 
structural classification of proteins database for the investigation of sequences 
and structures. (Murzin A. G. et al. (1995) J. Mol Biol. 247, 536-540); CATH - 
Class, Architecture, Topology and Homologous superfamily: a hierarchical 
classification of protein domain structures (Orengo et al. (1997) Structure 5(8) 
1093-1108); FSSP- Fold classification based on structure- structure alignment of 
proteins (Holm and Sander (1998) Nucl. Acids Res. 26 316-319); or VAST - 
Vector alignment search tool (Gibrat, Madej and Bryant (1996) Current Opinion 
in Structural Biology 6, 377-385) . 

In the above described approaches, members of structural classes are compared 
in order to identify insertion sites for SDRs. 

In a preferred variant of these approaches serine proteases of the structural 
class S1 are compared with each other. Trypsin represents a member with low 
substrate specificity, as it requires only an arginine or lysine residue at the R 
position. On the other hand, thrombin, tissue-type plasminogen activator or 
enterokinase all have a high specificity towards their substrate sequences, i.e. 
(L7I/V/F)XPR A IMA, CPGR A WGG and DDDK A , respectively (Perona, J. & Craik, C. 
(1997) J BioL Chem., 272, 29987-29990; Perona, J. & Cralk, C (1995) Protein 
Science, 4, 337-360). An alignment of the amino acid sequences of these 
proteases is described in example 1 (Figure 2) along with the identification of 
SDRs. 
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A further example within the family of serine proteases is given by members of 
the structural class S8 (subtilisin fold). Subtilisin is the type protease for this 
class and represents an unspecific protease (Ottesen.M. & Svendsen.A. (1998) 
Methods EnzymoL 19, 199-215). Furin, PC1 and PC5 are proteases of the same 
structural class involved in the processing of propeptides and have a high 
substrate specificity (Seidah, N. & Chretien, M. (1997) Curr. Opin. Biotech., 8: 
602-607; Bergeron, F. et al. (2000) J. Mof. Bidocrin., 24:1-22). In a preferred 
variant of the approach alignments of the primary amino acids sequences (Rgure 
4) are used to identify eleven sequence stretches longer than three amino acids 
which specific proteases have in addition compared to subtilisin and are therefore 
potential specificity determining regions. In a further variant of the approach 
information from the three-dimensional structure of subtilisin can be used in 
order to further narrow down the selection (Figure 3). Out of the eleven inserted 
sequence stretches, three are especially close to the active site residues, namely 
stretch number 7, 8 and 11 which are insertions in PC5, PC1 and all three 
specific proteases, respectively (Rgure 3). In a preferred variant, one or several 
amino acid stretches of variable length and composition can be inserted into the 
subtilisin sequence at one or several of the eleven positions. In a more preferred 
variant of the approach the insertion is performed at regions 7, 8 or 1 1 or any 
combination thereof. In another preferred variant of the approach protease 
scaffolds other than subtilisin from the structural class S8 are used. 

In a further preferred variant of this approach, aspartic acid proteases of the 
structural class A1 are analyzed (Rawlings, N.D. & Barrett, A.J. (1995). Methods 
EnzymoL 248, 105-120; Chitpinityol, S. & Crabbe, MJ. (1998), Food Chemistry, 
61, 395-418). Examples for the A1 structural class of aspartic proteases are 
pepsin with a low as well as beta-secretase (Gruninger-Leitch, F., et al. (2002) d 
Biol. Chem. 277 ', 4687-4693) and renin (Wang, W. & Liang, TC. (1994) 
Biochemistry, 33, 14636-14641) with relatively high substrate specificities. 
Retroviral proteases also belong to this class, although the active enzyme is a 
dimer of two identical subunits. The viral proteases are essential for the correct 
processing of the polyprotein precursor to generate functional proteins which 
requires a high substrate specificity in each case (Wu, J. et al. (1998) 
Biochemistry, 37, 4518-4526; Pettit, S. et al. (1991) J. Biol. Chem., 266, 14539- 
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14547). Pepsin is the type protease for this class and represents an unspecific 
protease (Kageyama, T. (2002) Cell. Mol. Life Sci. 59, 288-306). B-secretase and 
Cathepsin D (Aguilar, C. F. et al. (1995) Adv. Exp. Med. Biol. 362, 155-166) are 
proteases of the same structural class and have a high substrate specificity. In a 
preferred variant of the approach alignments of the primary amino acids 
sequences (Rgure 6) are used to identify six sequence stretches longer than 
three amino acids which are inserted in the specific proteases compared to 
pepsin and are therefore potential specificity determining regions. In a further 
variant of the approach information from the three-dimensional structure of l> 
secretase can be used in order to further narrow down the selection. Out of the 
six inserted sequence stretches, three are especially close to the active site 
residues, namely stretch number 1, 3 and 4 which are insertions in cathepsin D 
and beta-secretase, respectively (Rgure 5). In a preferred variant of the 
approach, one or several amino acid stretches of variable length and composition 
can be inserted into the pepsin sequence at one or several of the six positions. In 
a more preferred embodiment of the invention the insertion is performed at the 
positions 1, 3 or 4 or any combination thereof. In another preferred embodiment 
of the invention protease scaffolds other than pepsin are used. 

There are cases where a certain structural class does not include known 
members of bw and high specificity. This is exemplified by the C14 class of 
caspases which belong to the cysteine protease family (Rawlings, N.D. & Barrett, 
A.J. (1994) Methods EnzymoL 244, 461-486 ) and which all show high specificity 
for P 4 to Pi positions. For example, caspase-1, caspase-3 and caspase-9 
recognize the sequences YVAD A , DEVD A or LEHD A , respectively. Identification 
of the regions that differ between the caspases will include the regions 
responsible for the differences in substrate specificity (Rgures 7 and 8). 

Rnally, non-enzymatic proteins of the same fold as the enzyme scaffold may also 
contribute to the identification of insertion sites for SDRs. For example, 
haptoglobin (Arcoleo, J. & Greer, J.; (1982) J Biol. Chem. 257, 10063-10068) 
and azurocidin (Almeida, R. et al. (1991) Biochem. Biophys. Res. Commun. 177, 
688-695) share the same chymotrypsin-like fold with all S1 proteases. Due to 
substitutions in the active site residues these proteins do not posses any 
proteolytic function, yet they show high homology with active proteases. 
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Differences between these proteins and specific proteases include regions that 
can serve as insertion sites for SDRs. 

In a fourth approach, insertion sites for SDRs are identified experimentally by 
techniques such as alanine scanning, random mutagenesis, random insertion or 
random deletion. In contrast to the approach disclosed above, this approach 
does not require detailed knowledge about the three-dimensional structure of the 
scaffold protein. In one preferred variant of this approach, random mutagenesis 
of enzymes with relatively high specificity from the same structural class as the 
protein scaffold and screening for loss or change of specificity can be used to 
identify insertion sites for SDRs in the protein scaffold. 

Random mutagenesis, alanine scanning, random insertion or random deletion are 
all done on the level of the polynucleotides encoding the enzymes. There are a 
variety of protocols known in the literature (e.g. Sambrook, J.F; Fritsch, E.F.; 
Maniatis.T.; Cbid Spring Harbor Laboratory Press, Second Edition, 1989, New 
York}. For example, random mutagenesis can be achieved by the use of a 
polymerase as described in patent WO 9218645. According to this patent, the 
one or more genes encoding the one or more proteases are amplified by use of a 
DNA polymerase with a high error rate or under conditions that increase the rate 
of misincorporations. For example the method of Cadweii and Joyce can be 
employed (Cadwell, R.C. and Joyce, G.F., PCR methods. Appl. 2 (1992) 28-33). 
Other methods of random mutagenesis such as, but not limited to, the use of 
mutator stains, chemical mutagens or UV-radiation can be employed as well. 

Alternatively, oligonucleotides can be used for mutagenesis that substitute 
randomly distributed amino acid residues with an alanine. This method Is 
generally referred to as alanine scanning mutagenesis (Fersht, A.R. Biochemistry 
(1989) 8031-8036). As a further alternative, modifications of the alanine 
scanning mutagenesis such as binominal mutagenesis (Gregoret, LM. and Sauer, 
R.T. PNAS (1993) 4246-4250) or combinatorial alanine scanning (Weiss et al., 
PNAS (2000) 8950-8954) can be employed. 

In order to express engineered enzymes, the DNA encoding such engineered 
proteins is ligated Into a suitable expression vector by standard molecular cloning 
techniques (e.g. Sambrook, J.F; Fritsch, E.F.; Maniatls, T.; Cold Spring Harbor 
Laboratory Press, Second Edition, 1989, New York). The vector is introduced in a 
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suitable expression host cell, which expresses the corresponding engineered 
enzyme variant. Particularly suitable expression hosts are bacterial expression 
hosts such as Escherichia coli or Bacillus subtilis, or yeast expression hosts such 
as Saccharomyces cerevisae or Pichia pastoris, or mammalian expression hosts 
such as Chinese Hamster Ovary (CHO) or Baby Hamster Kidney (BHK) cell lines, 
or viral expression systems such as bacteriophages like M13 or Lambda, or 
viruses such as the Baculovirus expression system. As a further alternative, 

* 

systems for in vitro protein expression can be used. Typically, the DNA is ligated 
into an expression vector behind a suitable signal sequence that leads to 
secretion of the enzyme variants into the extracellular space, thereby allowing 
direct detection of protease activity in the ceil supernatant. Particularly suitable 
signal sequences for Escherichia coli are HlyA, for Bacillus subtilis AprE, NprB, 
Mpr, AmyA, AmyE, Blac, SacB, and for S. cerevisiae Bar1, Suc2, Mata, InulA, 

Ggplp. Alternatively, the enzyme variants are expressed intracellular^ and the 
substrates are expressed also intracellular^. Preferably, this is done essentially 
as described in patent application WO 0212543, using a fusion peptide substrate 
comprising two auto -fluorescent proteins linked by the substrate amino-acid 
sequence. As a further alternative, after intracellular expression of the enzyme 
variants, or secretion into the periplasmatic space using signal sequences such 
as DsbA, PhoA, PelB, OmpA, OmpT or gill for Escherichia coli, a permeabillsation 
or lysis step releases the enzyme variants into the supernatant. The destruction 
of the membrane barrier can be forced by the use of mechanical means such as 
ultrasonic, French press, or the use of membrane-digesting enzymes such as 
lysozyme. As another, further alternative, the genes encoding the enzyme 
variants are expressed cell-free by the use of a suitable cell-free expression 
system. For example, the S30 extract from Escherichia coli cells is used for this 
purpose as described by Lesly et al. (Methods in Molecular Biology 37 (1995) 
265-278). 

The ensemble of gene variants generated and expressed by any of the above 
methods are analyzed with respect to their affinity, substrate specificity or 
activity by appropriate assay and screening methods as described in detail for 
example in patent application PCT/EP03/04864. Genes from catalytically active 
variants having reduced specificity in comparison to the original enzyme are 
analyzed by sequencing. Sites at which mutations and/ or insertions and/or 
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deletions occurred are preferred insertion sites at which SDRs can be inserted 
site- specifically. 

In a second embodiment, the one or more fully or partially random peptide 
sequences are inserted at random sit? ft in the protein scaffold. This modification 
is usually done on the polynucleotide level, i.e. by inserting nucleotide sequences 
into the gene that encodes the protein scaffold. Several methods are available 
that enable the random insertion of nucleotide sequences. Systems that can be 
used for random insertion are for example ligation based systems (Murakami et 
al. Nature Biotechnology 20 (2002) 76-81), systems based on DNA 
polymerisation and transposon based systems (e.g. GPS-M™ mutagenesis 
system, NEB Biolabs; MGS™ mutation generation system, Finnzymes). The 
transposon-based methods employ a transposase- mediated insertion of a 
selectable marker gene that contains at its termini recognition sequences for the 
transposase as well as two sites for a rare cutting restriction endonuclease. Using 
the latter endonuclease one usually releases the selection marker and after 
religation obtains an insertion. Instead of performing the religation one can 
alternatively insert a fragment that has terminal recognition sequences for one or 
two outside cutting restriction endonuclease as well as a selectable marker. After 
ligation, one releases this fragment using the one or two outside cutting 
endonucleases. After creating blunt ends by standard methods one inserts blunt 
ended random fragments at random positions into the gene. 
In a further preferred embodiment, methods for homologous in-vitro 
recombination are used to combine the mutations introduced by the above 
mentioned methods to generate enzyme populations. Examples of methods that 
can be applied are the Recombination Chain Reaction (RCR) according to patent 
application WO 0134835, the DNA- Shuffling method according to the patent 
application WO 9522625, the Staggered Extension method according to patent 
WO 9842728, or the Random Priming recombination according to patent 
application W09842728. Furthermore, also methods for non-homologous 
recombination such as the Itchy method can be applied (Ostermeier, M. et al. 
Nature Biotechnology 17 (1999) 1205-1209). 

Upon random insertion of a nucleotide sequence into the protein scaffold one 
obtains a library of different genes encoding enzyme variants. The polynucleotide 
library is subsequently transferred to an appropriate expression vector. Upon 
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expression in a suitable host or by use of an In vitro expression system, a library 
of enzymes containing randomly inserted stretches of amino acids is obtained. 

According to step (b) of this third aspect of the invention, one or more fully or 
partially random peptide sequences are inserted into the protein scaffold. The 
actual number of such inserted SDRs is determined by the intended quantitative 
specificity following the relation: the higher the intended specificity is, the more 
SDRs are inserted. Whereas a single SDR enables the generation of moderately 
specific enzymes, two SDRs enable already the generation of significantly specific 
enzymes. However, up to six and more SDRs can be inserted into a protein 
scaffold. A similar relation is valid for the length of the SDRs: the higher the 
intended specificity is, the longer are the SDRs that are to be inserted. SDRs can 
be as short as one to four amino acid residues. They can, however, also be as 
long as 50 amino acid residues. Significant specificity can already be generated 
by the use of SDRs of a length of four to six amino acid residues. 

The peptid sequences that are inserted can be fully or partially random. In this 
context, fully random means that a set of sequences are inserted in parallel that 
includes sequences that differ from each other in each and every position. 
Partially random means that a set of sequences are inserted in parallel that 
includes sequences that differ from each other in at least one position. This 
difference can be either pair-wise or with respect to a single sequence. For 
example, when regarding an insertion of the length of four amino acids, partial 
random could be a set (I) that includes AGGG, GVGG, GGLG, GGGI, or (ii) that 
includes AGGG, VGGG, LGGG and IGGG. Alternatively, random sequences also 
comprises sequences that differ from each other in length. Randomization of the 
peptide sequences is achieved by randomization of the nucleotide sequences that 
are inserted into the gene at the respective sites. Thereby, randomization can be 
achieved by employing mixtures of nucleobases as monomers during chemical 
synthesis of the oligonucleotides. A particularly preferred mixture of monomers 
for a fully random codon that in addition minimizes the probability of stop codons 
is NN(GTC). Alternatively, random oligonucleotides can be obtained by 
fragmentation of DNA into short fragments that are inserted into the gene at the 
respective sites. The source of the DNA to be fragmented may be a synthetic 
oligonucleotide but alternatively may originate from cloned genes, cDNAs, or 
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genomic DNA. Preferably, the DNA is a gene encoding an enzyme. The 
fragmentation can, for example, be achieved by random endonucleolytic 
digestion of DNA. Preferably, an unspecific endonuclease such as DNAse I (e.g. 
from bovine pancreas) is employed for the endonucleolytic digestion. 

If steps (a) - (c) of the inventive method are repeated cyclically, there are 
different alternatives for obtaining random peptide sequences that are inserted in 
consecutive rounds. Preferably, SDRs that were identified in one round as leading 
to increased specificity of enzyme are used as templates for the random peptide 
sequences that are inserted in the following round. 

In a preferred alternative, the sequences selected in one round are analysed and 
randomized oligonucleotides are generated based on these sequences. This can, 
for example, be achieved by using in addition to the original nucleotide with a 
certain percentage mixtures of the other three nucleotides monomers at each 
position in the oligonucleotide synthesis. If, for example, in a first round an SDRs 
is identified that has the amino acid sequence ARLT, e.g. encoded by the 
nucleotide sequence GCG CGC CTT ACC, a random peptide sequence inserted in 
this SDR site could be encoded by an oligonucleotide with 70% G, 1 0% A, 1 0% T 
and 10% C at the first position, 70% C, 10% G, 10% T and 10% A at the second 
position, etc. This leads at each position approximately in 1 of 3 cases to the 
template amino acid and In 2 of 3 cases to another amino acid. 
In another preferred alternative, the sequences selected in one round are 
analyzed and a consensus library is generated based on these sequences. This 
can, for example, be achieved by using defined mixtures of nucleotides at each 
position in the oligonucleotide synthesis in a way that leads to mixtures of the 
amino acid residues that were identified at each position of the SDR selected in 
the previous round. If, for example, in a first round two SDRs are identified that 
have the amino acid sequences ARLT and VPGS, a consensus library inserted in 
this SDR site in the following round could be encoded by an oligonucleotide with 
the sequence G(C/T)G C(G/C)C (G/T)(G/T)G (A/T)CC. This would correspond to 
the random peptide sequence (A/V)(R/P)(L/G/V/W)(T/S), thereby allowing all 
combinations of the amino acid residues identified in the first round, and, due to 
the degeneracy of the genetic code, allowing in addition to a lower degree 
alternative amino acid residues at some positions. 
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In another preferred alternative, the sequences selected in one round are, 
without previous analysis, . recombined using methods for the in vitro 
recombination of polynucleotides, such as the methods described in WO 
01/34835 (the following also provides details of the eighth and ninth aspect of 
the invention). 

After insertion of the partially or fully random sequences into the gene encoding 
the scaffold protein, and eventually ligation of the resulting gene into a suitable 
expression vector using standard molecular cloning techniques (Sambrook, J.F; 
Fritsch, E.F.; Maniatis,T.; Cold Spring Harbor Laboratory Press, Second Edition, 
1989, New York), the vector is introduced in a suitable expression host cell which 
expresses the corresponding enzyme variant. Particularly suitable expression 
hosts are bacterial expression hosts such as Escherichia coli or Bacillus subtilis, 
or yeast expression hosts such as Saccharomyces cerevisae or Pichia pastoris, or 
mammalian expression hosts such as Chinese Hamster Ovary (CHO) or Baby 
Hamster Kidney (BHK) cell lines, or viral expression systems such as 
bacteriophages like M13 T7 phage or Lambda, or viruses such as the Baculovirus 
expression system. As a further alternative, systems for in vitro protein 
expression can be used. Typically, the DNA is ligated into an expression vector 
behind a suitable signal sequence that leads to secretion of the enzyme variants 
into the extracellular space, thereby allowing direct detection of enzyme activity 
in the ceil supernatant. Particularly suitable signal sequences for Escherichia coli 
are ompA, pelB, HlyA, for Bacillus subtilis AprE, NprB, Mpr, Amy A, AmyE, Blac, 
SacB, and for S. cerevisiae Bar1 , Suc2, Matot, InulA, Ggplp. Alternatively, the 
enzyme variants are expressed intracellular^ and the substrates are expressed 
also intracellular^. According to protease variants this is done essentially as 
described in patent application WO 0212543, using a fusion peptide substrate 
comprising two auto-fluorescent proteins linked by the substrate amino-acid 
sequence. As a further alternative, after intracellular expression of the enzyme 
variants, or secretion into the periplasmatic space using signal sequences such 
as DsbA, PhoA, PelB, OmpA, OmpT or gill for Escherichia coli, a permeabilisation 
or lysis step releases the enzyme variants into the supernatant. The destruction 
of the membrane barrier can be forced by the use of mechanical means such as 
ultrasonic, French press, or the use of membrane-digesting enzymes such as 
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lysozyme. As another, further alternative, the genes encoding the enzyme 
variants are expressed cell-free by the use of a suitable cell-free expression 
system. For example, the S30 extract from Escherichia coli cells is used for this 
purpose as described by Lesly et al. (Methods in Molecular Biology 37 (1995) 
265-278). 

After introduction of the vector into host cells, these cells are screened for the 
expression of enzymes with specificity for the intended target substrate. Such 
screening is typically done by separating the cells from each other, in order to 
enable the correlation of genotype and phenotype, and assaying the activity of 
each cell clone after a growth and expression period. Such separation can for 
example be done by distribution of the cells into the compartments of sample 
carriers, e.g. as described in WO 01/24933. Alternatively, the cells are separated 
by streaking on agar plates, by enclosing in a polymer such as agarose, by filling 
into capillaries, or by similar methods. 

Identification of variants with the intended specificity can be done by different 
approaches. In the case of proteases, preferably assays using peptide substrates 
essentially as described in PCT/EP03/04864 are employed. 

Regardless of the expression format, selection of enzyme variants is done under 
conditions that allow identification of enzymes that recognize and convert the 
target sequence preferably. As a first alternative, enzymes that recognize and 
convert the target sequence preferably are identified by screening for enzymes 
with a high affinity for the target substrate sequence. High affinity corresponds 
to a low which is selected by screening at target substrate concentrations 
substantially below the K M of the first enzyme. Preferably, the substrates that are 
used are linked to one or more fluorophores that enable the detection of the 
modification of the substrate at concentrations below 10 |iM, preferably below 1 
pM, more preferably below 100 nM, and most preferably below 10 nM. 

As a second alternative, enzymes that recognize and convert the target substrate 
preferably are Identified by employing two or more substrates in the assay and 
screening for activity on these two or more substrates in comparison. Preferably, 
the two or more substrates employed are linked to different marker molecules, 
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thereby enabling the detection of the modification of the two or more substrates 
consecutively or in parallel. In the case of proteases, particularly preferably two 
peptide substrates are employed, one peptide substrate having an arbitrarily 
chosen or even partially or fully random amino-acid sequence thereby enabling 
to monitor the activity on an arbitrary substrate, and the other peptide substrate 
having an amino-acid sequence identical to or resembling the intended target 
substrate sequence thereby enabling to monitor the activity on the target 
substrate. Especially preferably, these two peptide substrates are linked to 
fluorescent marker molecules, and the fluorescent properties of the two peptide 
substrates are sufficiently different in order to distinguish both activities when 
measured consecutively or in parallel. For example, a fusion protein comprising a 
first autofluorescent protein, a peptide, and a second autofluorescent protein 
according to patent application WO 0212543 can be used for this purpose. 
Alternatively, fluorophores such as rhodamines are linked chemically to the 
peptide substrates. 

As a third alternative, enzymes that recognize and convert the target substrate 
preferably are identified by employing one or more substrates resembling the 
target substrate together with competing substrates in high excess. Screening 
with respect to activity on the substrates resembling the target substrate is then 
done in the presence of the competing substrates. Enzymes having a specificity 
which corresponds qualitatively to the target specificity, but having only a low 
quantitative specificity are identified as negative samples in such a screen. 
Whereas enzymes having a specificity which corresponds qualitatively and 
quantitatively to the target specificity are identified positively. Preferably, the 
one or more substrates resembling the target substrate are linked to marker 
molecules, thereby enabling the detection of their modifications, whereas the 
competing substrates do not carry marker molecules. The competing substrates 
have arbitrarily chosen or random amino-acid sequences, thereby acting as 
competitive inhibitors for the hydrolysis of the marker-carrying substrates. For 
example, protein hydrolysates such as Trypton can serve as competing 
substrates for engineered proteolytic enzymes according to the invention. 
As a fourth alternative, enzymes that recognize and convert the target substrate 
preferably are identified and selected by an amplification-coupled or growth- 
coupled selection step. Furthermore, the activity can be measured intracellularily 
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and the selection can be done by a cell sorter, such as a fluorescence-activated 
cell sorter. 

As a further alternative, enzymes that recognize and convert the target substrate 
are identified by first selecting enzymes that preferentially bind to the target 
substrate, and secondly selecting out of this subgroup of enzyme variants those 
enzymes that convert the target substrate. Selection for enzymes that 
preferentially bind the target substrate can be either done by selection of binders 
to the target substrate or by counter-selection of enzymes that bind to other 
substrates. Methods for the selection of binders or for the counter-selection of 
non-binders is known in the art. Such methods typically require phenotype- 
genotype coupling which can be solved by using surface display expression 
methods. Such methods include, for example, phage or viral display, cell surface 
display and in vitro display. Phage or viral display typically involves fusion of the 
protein of interest to a viral/phage protein. Cell surface display, i.e. either 
bacteria! or eukaryotic cell display, typically involves fusion of the protein of 
interest to a peptide or protein that is located at the cell surface. In in-vitro 
display, the protein is typically made in vitro and linked directly or Indirectly to 
the m RNA encoding the protein (DE 1 9646372) . 

The invention also provides for a composition or pharmaceutical composition 
comprising one or more engineered enzymes according to the first aspect of the 
invention as defined herein before. The composition may optionally comprise an 
acceptable carrier, excipient and/or auxiliary agent. 

Pharmaceutical compositions according to the invention may optionally comprise 
a pharmaceutical^ acceptable carrier. Pharmaceutical formulations are well 
known and pharmaceutical compositions may be routinely formulated by one 
having ordinary skill in the art. The composition can be formulated as a solution, 
suspension, emulsion, or lyophilized powderin association with a pharma- 
ceutical^ acceptable vehicle. Examples of such vehicles are water, saline, 
Ringer's solution, dextrose solution, and human serum albumin. Liposomes and 
nonaaqueous vehicles such as fixed oils may also be used. The vehicle or 
lyophilized powder may contain additives that maintain isotonicity (e.g. sodium 
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chloride, mannitol) and chemical stability (e.g. buffers and preservatives). The 
composition is sterilized by commonly used techniques. 

The pharmaceutical composition of the present invention may be administrated 
by any means that enables the active agent to reach the agent' s site of action in 
the body of a mammal. Pharmaceutical compositions may be administered 
parentally, i.e. intravenous, subcutaneous, intramuscular. 

Dosage varies depending upon known factors such as the pharmacodynamic 
characteristics of the particular agent, and its mode and route of administration ; 
age, health, and weight of the recipient; nature and extent of symptoms, kind of 
concurrent treatment, frequency of treatment, and the effect desired. 

Non-pharamceutical compositions as defined herein are research composition, 
nutritional composition, cleaning composition, desinfection composition, cosmetic 
composition or composition for personal care. Moreover, DNA sequences coding 
for the engineered enzyme as defined herein before and vectors containing said 
DNA sequences are also provided. Finally, transformed host cells (prokaryotic or 
eukaryotic) or transgenic organisms containing such DNA sequences and/or 
vectors, as well as a method utilizing such host ceils or transgenic animals for 
producing the engineered enzyme of the first aspect of the invention are also 
contemplated. 

Detailed description of the figures 

Figure 1 Three-dimensional structure of human trypsin I with the active site 
residues shown in "ball-and-stick" representation and with the marked regions 
indicating potential SDR insertion sites. 

Figure 2: Alignment of the primary amino acid sequences of the human 
proteases trypsin I, alpha-thrombin and enteropeptidase all of which belong to 
the structural class S1 of the serine protease family. Trypsin represents an 
unspecific protease of this structural class, while alpha-thrombln and 
enteropeptidase are proteases with high substrate specificity. Compared to 
trypsin several regions of insertions of three or more amino acids into the 
primary sequence of a-thrombin and enterokinase are seen. The region marked 
with and the region marked with (-3-) are preferred SDR insertion sites. In 
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the tertiary structure of alpha-thrombin both regions are in the vicinity of the 
substrate binding site. These regions therefore fullfil two criteria to be selected 
as candidates for SDRs: firstly, they represent insertions in the specific proteases 
compared to the unspecific one and, secondly, they are close to the substrate 
binding site. A representation of the three-dimensional structure is given in 
figure 3. 

Figure 3: Three-dimensional structure of subtilisin with the active site residues 
being shown in "ball-and-stick" representation and with the numbered regions 
indicating potential SDR insertion sites. 

figure 4: Alignment of the primary amino acid sequences of subtilisin E, furin, 
PC1 and PC5 all of which belong to the structural class S8 of the serine protease 
family. Subtilisin E represents an unspecific protease of this structural class, 
while furin, PC1 and PC5 are proteases with high substrate specificity. Compared 
to subtilisin several regions of insertions of three or more amino acids into the 
primary sequence of furin, PC1 and PCS are seen. The regions marked with (-4-), 
(-5-), (-7-), (-9-) and (-11-) are preferred SDR insertion sites. These regions 
stretches fulfill two criteria to be selected as candidates for SDRs: firstly, they 
represent insertions in the specific proteases compared to the unspecific one 
and, secondly, they are close to the active site residues. 

Figure 5l Three-dimensional structure of beta-secretase with the active site 
residues being shown in "ball-and-stick" representation and with the numbered 
regions indicating potential SDR insertion sites. 

Figure 6: Alignment of the primary amino acid sequences of pepsin, b-secretase 
and cathepsin D, all of which belong to the structural class A1 of the aspartic 
protease family. Pepsin represents an unspecific protease of this structural class, 
while b-secretase and cathepsin D are proteases with high substrate specificity. 
Compared to pepsin several regions of insertions of three or more amino acids 
into the primary sequence of b-secretase and cathepsin D are seen. The regions 
marked with -1- to -11- correspond to possible SDR combining sites and are 
also marked in Fig. 5. 
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Figure 7 : illustrates the three-dimensional structure of caspase 7 with the active 
site residues being shown in "ball- and- stick" representation and with the 
numbered regions indicating potential SDR insertion sites. 

Figure 8 : shows the primary amino acid sequence of caspase 7 as a member of 
the cysteine protease class C14 family (see also SEQ ID NO: 14). 

Figure 9: Schematic representation of method according to the third aspect of 
the invention. 

Figure 10: Western blot analysis of trypsin expression. Supernatant of cell 
cultures expressing variants of trypsin are compared to negative controls. Lane 
1: molecular weight standard; lane 2: negative control; lane 3: supernatant of 
variant a; lane 4: negative control; lane 5: supernatant of variant b. A primary 
antibody specific to the expressed protein and a secondary antibody for 
generation of the signal were used. 

Figure 1 1 : Time course of the proteolytic cleavage of a target substrate. 
Supernatant of cells containing the vector with the gene for human trypsin and 
that of cells containing the vector without the gene was incubated with the 
peptide substrate described in the text. Cleavage of the peptide results in a 
decreased read out value. Proteolytic activity is confirmed for the positive clone. 

Figure 12: Relative activity of three engineered proteolytic enzymes in 
comparison with human trypsin I on two different peptide substrates. A time 
course of the proteolytic digestion of the two substrates was performed and 
evaluated. Substrate B was used for screening and substrate A is a closely 
related sequence. Relative activity of the three variants was normalized to the 
activity of human trypsin I. Variant 1 and 2 clearly show hcreased specificity 
towards the target substrate. Variant 3, on the other hand, serves as a negative 
control with similar activities as the human trypsin I. 

Figure 13: Relative specificities of trypsin and variants of engineered proteolytic 
enzymes with one or two SDRs, respectively. Activity of the proteases was 
determined in the presence and absence of competitor substrate, i.e. peptone at 
a concentration of 10mg/mi. Time courses for the proteolytic cleavage were 
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recorded and the time constants k determined. The ratios between the time 
constants with and without competitor were formed and represent a quantitative 
measure for the specificity of the protease. The ratios were normalized to 
trypsin. The specificity of the variant containing two SDRs is 2.5 fold higher than 
that of the variant with SDR2 alone. 

Figure 14: Shows the relative specificities of protease variants in absence and 
presence of competitor substrate. The protease variants containig two inserts 
with different sequences and the non-modified scaffold human trypsin I were 
expressed in a suitable host. Activity of the protease variants was determined as 
the cleavage rate of a peptide with the desired target sequence of TNF- alpha in 
the absence and presence of competitor substrate. Specificity is expressed as the 
ratio of cleavage rates in the presence and absence of competitor. 

Figure 15 : The figure shows the reduction of cytotoxicity induced by human TNF- 
aipha when incubating the human TNF-alpha with concentrated supernatant from 
cultures expressing the inventive engineered proteolytic enzymes being specific 
for human TNF-alpha. This indicates the efficacy of the inventive engineered 
proteolytic enzymes. 

Figure 16: T he figure shows the reduction of cytotoxicity induced by human TNF- 
alpha when incubating the human TNF-alpha with different concentrations of 
purified inventive engineered proteolytic enzyme being specific for human TNF- 
alpha. Variant g comprises SEQ ID NO: 72 as SDR1 and SEQ ID NO: 73 as SDR2. 
This indicates the efficacy of the inventive engineered proteolytic enzymes. 

Figure 17 : The figure compares the activity of inventive engineered proteolytic 
enzymes being specific for human TNF-alpha with the activity of human trypsin I 
on two protein substrates: (a) human TNF-alpha; (b) mixture of human serum 
proteins. This indicates the safety of the inventive engineered proteolytic 
enzymes. Variant x corresponds to Seq ID No: 75 comprising the SDRs according 
to Seq ID No. 89 (SDR1) and 95 (SDR2) . Variants xi and xii correspond to 
derivatives thereof comprising the same SDR sequences. 

Figure 18: Specific hydrolysis of human VEGF by an engineered proteolytic 
enzyme derived from human trypsin. 
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Examples 

In the following examples, materials and methods of the present Invention are 
provided including the determination of catalytic properties of enzymes obtained 
by the method. It should be understood that these examples are for illustrative 
purpose only and are not to be construed as limiting this invention in any 
manner. All publications, patents, and patent applications cited herein are hereby 
incorporated by reference in their entirety for all purposes. 

In the experimental examples described below, standard techniques of 
recombinant DNA technology were used that were described in various 
publications, e.g. Sambrook et al. (1989), Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, or Ausubel et al. (1987), Current 
Protocols in Molecular Biology 1987-1988, Wiley Interscience. Unless otherwise 
indicated, restriction enzymes, polymerases and other enzymes as well as DNA 
purification kits were used according to the manufacturers specifications. 

Example I: Identification of SDR sites in hu man trypsin 

Insertion sites for SDRs have been identified in the serine protease human 
trypsin I (structural class S1) by comparison with members of the same 
structural class having a higher sequence specificity. Trypsin represents a 
member with low substrate specificity, as it requires only an arginine or lysine 
residue at the R position. On the other hand, thrombin, tissue-type plasminogen 
activator or enterokinase all have a high specificity towards their substrate 
sequences, i.e. (L/I/V/F)XPR A NA, CPGR A VVGG and DDDK A , respectively. The 
primary sequences and tertiary structures of these and further S1 serine 
proteases have been aligned in order to determine regions of low and high 
sequence and structure homology and especially regions that correspond to 
insertions in the sequences of the more specific proteases (Figure 2). Several 
regions of insertions equal or longer than 3 amino acids representing potential 
SDR sites have been identified as indicated in Figure 1. These regions were 
chosen as target sites for the insertion of SDRs in the examples below, e.g. SDR1 
(region one in figure 2, after amino acid 42 according to SEQ ID NO:1) with a 
length of six and SDR2 (region three in figure 2, after amino acid 123 according 
to SEQ ID NO:1) with a length of five amino acids, respectively. 
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Example II: Molecular cloning of the human trypsin I gene to be used as scaffold 
protein and expression of the mature prote ase in B. subtilis 

The gene encoding the unspecific protease human trypsinogen I was cloned into 
the vector pl)C18. Cloning was done as follows: the coding sequence of the 
protein was amplified by PGR using primers that introduced a Kpnl site at the 5* 
end and a Bam HI site at the 3* end. This PCR fragment was cloned into the 
appropriate sites of the vector pUC18. Identity was confirmed by sequencing. 
After sequencing the coding sequence of the mature protein was amplified by 
PCR using primers that introduced different Bgll sites at the 5' end and the 3' 
end. 

This PCR fragment was cloned into the appropriate sites of an E. coll - B. subtilis 
shuttle vector. The vector contains a pMB1 origin for amplification in E. coli, a 
neomycin resistance marker for selection in E. coli, as well as a P43 promoter for 
the constitutive expression in B. subtilis. A 87 bp fragment that contains the 
leader sequence encoding the signal peptide from the sacB gene of B. subtiiis 
was introduced behind the P43 promoter. Different Bgll restriction sites serve as 
insertion sites for heterologous genes to be expressed. 

Expression of human trypsin I was confirmed by measurement of the proteolytic 
aciticity in supernatant of cells containing the vector with the gene In comparison 
to a negative control. A peptide including an arginine cleavage site was chosen 
as a substrate. The peptide was ^terminally biotinylated and labeled with a 
fluorophore at the C-terminus. After incubation of the peptide with culture 
supernatant streptavidin was added. Uncleaved peptide associate with 
streptavidin and lead to a high read out value while cleavage results in low read 
out values. Figure 1 1 shows the time course of a proteolytic digestion of B. 
subtilis cells containing the vector with the trypsin I gene in comparison to B. 
subtilis cells containing the vector without the trypsin I gene (negative control). 
As a further confirmation of expression of the protease, supernatants of cells 
containing the vector with the gene and control cells were analyzed by 
polyacrylamid gel electrophoreses and subsequent western blot using an 
antibody specific to the target protease. The procedure was performed according 
to standard methods (Sambrook, J.F; Fritsch, E.F.; Maniatis,T.; Cold Spring 
Harbor Laboratory Press, Second Edition, 1989, New York). Figure 8 confirms 
expression of the protein only in the cells harbouring the vector with the gene for 
trypsin. 
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Example 111: Providing a scaffold protein 

In this example, human trypsin I was used as the scaffold protein. The gene was 
either used in its natural form, or, alternatively, was modified to result in a 
scaffold protein with increased catalytic activity or further improved 
characteristics. 

The modification was done by random modification of the gene, followed by 
expression of the enzyme and subsequent selection for increased activity. First, 
the gene was PGR amplified under error-prone conditions, essentially as 
described by Cadwell, R.C and Joyce, G.F. (PCR Methods Appl. 2 (1992) 28-33). 
Error-prone PCR was done using 30 pmol of each primer, 20 nmol dGTP and 
dATP, 100 nmol dCTP and dTTP, 20 fmol template, and 5 U Taq DNA polymerase 
in 10 mM Tris HCI pH 7.6, 50 mM KCI, 7 mM MgCI2, 0.5 mM MnCI2, 0.01 % 
gelatin for 20 cycles of 1 min at 94 °C, 1 min at 65 °C and 1 min at 72 °C. The 
resulting DNA library was purified using the Qiaquick PCR Purification Kit 
following the suppliers 1 instructions. The PCR product was digested with the 
restriction enzyme Bg!\ and purified. Afterwards, the PCR product was ligated 
into the E. coli - B. subtiiis shuttle vector described above which was digested 
with Bgll and dephosphorylated. The ligation products were transformed into E. 
coli, amplified in LB, and the plasmids were purified using the Qiagen Plasm id 
Purification Kit following the suppliers' instructions. Resulting plasmids were 
transformed Into B. subtiiis cells. 

Alternatively, or in addition to random mutagenesis, variants of the gene were 
statistically recombined at homologous positions by use of the Recombination 
Chain Reaction, essentially as described in WO 0134835. PCR products of the 
genes encoding the protease variants were purified using the QIAquick PCR 
Purification Kit following the suppliers' instructions, checked for correct size by 
agarose gel electrophoresis and mixed together in equimolar amounts. 80 \xg of 
this PCR mix in 150 mM TrisHCI pH 7.6, 6.6 mM MgCb were heated for 5 min at 
94 °C and subsequently cooled down to 37 °C at 0.05 °C/s in order to re- anneal 
strands and thereby produce heteroduplices in a stochastic manner. Then, 2.5 U 
Exonuciease (II per |jg DNA were added and incubated for 20, 40 or 60 min at 37 
°C in order to digest different lengths from both 3' ends of the heteroduplices. 
The partly digested PCR products were refilled with 0.6 U Pfu polymerase per |ig 
DNA by incubating for 15 min at 72 °C in 0.17 mM dNTPs and Pfu polymerase 
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buffer according to the suppliers' instructions. After performing a single PCR 
cycle, the resulting DNA was purified LBing the QIAquick PCR Purification Kit 
following the suppliers' instructions, digested with Bgli and llgated into the 
linearized vector. The ligation products were transformed into E. coli, amplified in 
LB containing ampicillin as marker, and the plasm ids were purified using the 
Qiagen Plasmid Purification Kit following the suppliers' instructions. Resulting 
plasmids were transformed into B. subtilis cells. 

Example IV: Insertion of SDRs into the protein scaffold of human trypsin I and 
generation of an engineered proteolytic enzvme with specificity for a peptide 
substrate having the sequence K KWLGRVPGGPV. 

In order to create insertion sites for SDRs in human trypsin I, two pairs of 
different restriction sites were introduced into the gene at sites that were 
identified as potential SDR sites (see Example I above) without changing the 

• 

amino acid sequence. The insertion of the restriction sites was done by overlap 
extension PCR. Primers restrl and restr2 were used for the introduction of Sac! I 
and Bam HI restriction sites, restr3 and restr4 were used for the introduction of 
Kpnl and Nhel restriction sites. The sequences of the primers were as follows: 

Binding site for restrl and restr2 and the corresponding amino acid sequence 
(SEQ ID NO: 54): 

5 r -GGTGGTA TCAGCAG GCCACTGCTACAAGTCCC GCATCCA GGT-3 * 
VV SAGHCYKSRIQ 

Forward primer restrl (SEQ ID NO: 56): 

5 * — GGTGGTAT CCGCGG GCCACrGCTACAAGTCC CGGATCCA GGT— 3 ' 

Reverse primer restr2 (SEQ ID NO:57): 

5 1 — ACC TGGATCC GGGACTTGTAGCAGTGG CCCGCGG ATACCACC— 3 f 

Binding site for restr3 and restr4 and the corresponding amino acid sequence 
(SEQ ID NO: 58): 

5 ' — CCACT GGCACGA AGTGCCTCATCTCTGGCTGGGGCAACACT GCGAGC TCT— 3 1 
TGTKCLISGWGNTASS 

Forward primer rest r3 (SEQ ID NO: 60): 
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5 1 -CCACT GGCACGA AGTGCCTCATCTCT6GCTGGGGCAACACT GCGAGC TCT-3 ' 

Reverse primer restr4 (SEQ ID NO: 61): 

5 AGAGCTAGCAGTGTTGCCCCAGCCAGAGATGAGGCACTTGGTACCAGTGG— 3 9 



In a first overlap extension PCR, the Sacll/BamHI sites were introduced, 
enabling to insert SDR1, and in a second overlap extension PCR the KpnI/Nhel 
sites, enabling the insertion of SDR2. The product of the overlap extension PCR 
was amplified using primers pUC-forward and pUC-reverse. The sequences of 
pUC-forward and pUC- reverse are as follows: 

pUC-forward(SEQIDNO:62): 5 *- GGSSIAQCCCACCACCATGAATCCACTCCT-3 ' 
pUC-reverse (SEQ ID NO:63) : 5 1 - CGGGATCCGGTATAG AGACTG AAGAG ATAC - 3 ' 

The restriction sites generated thereby were subsequently used to insert defined 
or random oligonucleotides into the SDR1 and SDR2 insertion sites by standard 
restriction and ligation methods. Typically, two complementary synthetic 5'- 
phosphorylated oligonucleotides were annealed and ligated into a vector carrying 
the modified human trypsin I gene that was cleaved with the respective 
restriction enzymes. Oligonucleotides encoding SDR1 were inserted via the 
Sacll/BamHI sites whereas oligonucleotides encoding SDR2 were inserted via the 
KpnI/Nhel sites. For each insertion an oligonucleotide pair according to the 
following general sequences was used ([P] indicating S'-phosphorylation, N and X 
indicating any nucleotide or amino acid residue, respectively): 
oligox-SDR1f (SEQ ID NO: 64): 

5 ' - [P ] -GGGCCACTGCTAC NNNNNNI^ ■ 

oligox-SDR1r (SEQ ID NO: 66): 

3 1 — CGCCCGGTGACGAT GNNNNNNTCI^ [P 3 -5 ' 

GHCYXXXXXXKS 



oligox-SDR2f (SEQ ID NO: 67): 

5 1 - [P] k:aagtgcctcatctctggctggggcaa cnnn^^ • 
oligox-SDR2r (SEQ ID NO:69): 

3 * -CATGGTTCACGGAGTAGAGACCGACCCCGTT GNNNNNNNNNt^ -5 1 

KCLISGWGNXXXXXT 
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As an alternative to the above method, a PGR based method was used for the 
integration of random -sequences into the SDR1 and SDR2 insertion sites in the 
modified human trypsin I. For each SDR, one primer was used where the SDR 
region is fully randomized. Sequences of the primers were as follows (N = 
A/C/G/T, B = C/G/T, V = A/C/G): 
Primer SDR1 -mutnnb- forward (SEQ ID NO:70): 

5 ' — TGGTATCCGCGGGCCACTGCTACNNBNNBNNBNNBNNBNNBAAGTCCCGGATCC ' 

Primer SDR2-mutnnb-reverse (SEQ ID NO:71): 

5 '-GGCGCCAGAGCTAGC^GTVNNVNNVN^^ ■ 

The codon NNB, or VNN in the reverse strand, allows all 20 amino acids to made, 
but reduces the probability of encoding a stop codon from 0.047 to 0.021 . 

As a further alternative, after identification of SDRs that lead to increased 
specificity, these SDRs were used as templates for further randomization. 
Thereby, random peptide sequences were inserted that were partially 
randomized at each position and partially identical at each position to the original 
sequence. 

As an example, random peptide sequences that have in approximately 1 of 3 
cases the template amino acid residue and in approximately 2 of 3 cases any 
other amino acid residue at each position were inserted into the two SDR 
insertion sites of the modified human trypsin I For this purpose, primers that 
contain at each nucleotide position of the SDR approximately 70% of the 
template bases and 30% of a mixture of the three other bases were used. 
With each primer pair a PCR was performed under standard conditions using the 
human trypsin I gene as template. The resulting DNA was purified using the 
QIAquick PCR Purification Kit following the suppliers* instructions and digested 
with Sac) I and Nhel. After digestion the DNA was purified and ligated into the 
Sacll and Nhel digested and dephosphorylayted vector. The ligation products 
were transformed into E. coli, amplified in LB containing the respective marker, 
and the plasm ids were purified using the Qiagen Plasm id Purification Kit following 
the suppliers' instructions. Resulting plasm ids were transformed into B. subtilis 
cells. These cells were then separated to single cells, grown to clones, and after 
expression of the protease gene screened for proteolytic activity. 
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The following substrates were employed for screening for proteolytic activity 
(SEQ ID NOs:76 and 77): 
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Protease variants were screened on substrate B at complexities of 10 B variants 
by confocal fluorescence spectroscopy. The substrate was a peptide biotinylated 
at the N-terminus and fluorescently labeled at the C-terminus. After incubation of 
the peptide with supernatant of cells expressing different variants of the 
protease, streptavidin is added and the samples are analysed by confocal 
fluorimetry. The low concentration of the peptide (20nM) leads to a preferential 
cleavage by proteases with a high kcat/Kw value, i.e. proteases with high 
specificity towards the target sequence. 

Variants selected In the screening procedure were further evaluated for their 
specificity towards substrate B and closely related substrate A by measuring time 
courses of the proteolytic digestion and determining the rate constants which are 
proportional to the kcat/Kw values. Clearly, compared to the human trypsin that 
was used as scaffold protein, the specific activity of variants 1 and 2 is shifted 
(SEQ ID NOs: 2 and 3, respectively) towards substrate B. Variant 3 (SEQ ID 
NO: 4), on the other hand, serves as a negative control with similar activities as 
the human trypsin I. Sequencing of the genes of the three variants revealed the 
following amino acid sequences in the SDRs. 

Table 2l Sequences of the two SDRs in three different variants selected for 
specific hydrolysis of substrate B (SEQ ID NOs: 78-83). 
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In a further experiment a pool of variants containing different numbers of SDRs 
per gene were screened for increased specificity using a mixture of the defined 
substrate and pepton as a competing substrate. Variants containing one or two 
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SDRs per gene have been analyzed further. As a measure for the specificity the 
activity in the peptide cleavage assay was compared with and without the 
presence of the competing substrate. The concentration of the competing 
substrate was 10mg/ml. Under these conditions, unspecific proteases show, 
compared to specific proteases, a stronger decrease in activity with increasing 
competitor concentrations (range between 0 and 100mg/rnl). The ratio of 
proteolytic activity with and without substrate is a quantitative measure for the 
specificity of the proteases. Figure 9 shows the relative activities with and 
without competing substrate. Human trypsin I that was used as the scaffold 
protein and two variants, one containing only SDR2, and one containing both 
SDRs, were compared. The specificity of the variant with both SDRs is by a factor 
of 2.5 higher than that of the variant with SDR2 only, confirming that there is a 
direct relation between the number of SDRs and the quantitative specificity of 
resulting engineered proteolytic enzymes. 

Example V: Generation of an engineered proteolytic enzyme that specifically 
inactivates human TNF-alpha 

Human trypsin alpha I or a derivative comprising one or more of the following 
amino acid substitutions E56G; R78W; Y131F; A146T; C183R was used as 
protein scaffold for the generation of an engineered proteolytic enzyme with high 
specificity towards human TNF-alpha. The identification of SDR sites in human 
trypsin I or derivatives thereof was done as described above. Two insertion sites 
within the scaffold were choosen for SDRs. The protease variants containing two 
inserts with different sequences and also the human trypsin I itself with no 
inserts were expressed in a Bacillus subtilis cells. The variant protease cells were 
separated to single cell clones and the protease expressing variants were 
screened for proteolytic activity on peptides with the desired target sequence of 
TNF-alpha. The activity of the protease variants was determined as the cleavage 
rate of a peptide with the desired target sequence of TNF-alpha in the absence 
and presence of competitor substrate. The specificity is expressed as the ratio of 
cleavage rates in the presence and absence of competitor (Fig. 14). 

Table 3 : Relative specificity of variants of engineered proteolytic enzymes with 
different SDR sequences in absence and presence of competitor substrate (SEQ 
ID NOs:84-95). 
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The antagonistic effect of three inventive protease variants on human TNF-alpha 
is shown in Figure 15. By the use of the variants, the induction of apoptosis is 
aimost completely eliminated indicating the anti- inflammatory efficacy of the 
inventive proteases to initiate TNF-alpha break down. TNF-alpha has been 
incubated with concentrated supernatant from cultures expressing the variants i 
to iii for 2 hours. The resulting TNF-alpha has been incubated with non-modified 
cells for 4 hours. The effect of the remaining TNF-alpha activity was determined 
as the extent of apoptosis induction by detection of activated caspase-3 as 
marker for apoptotic cells. For the controls either no protease was added with 
the human TNF-alpha (dead cells) or buffer instead of human TNF-alpha (live 
cells) was used, respectively. An analogous experiment is shown in Figure 16 
using purified variant xiii. TNF-alpha was incubated with different concentrations 
of the purified inventive protease variant. 

To demonstrate the specificity of the inventive protease variants, proteins from 
human blood serum or purified human TNF-alpha have been incubated with 
human trypsin I or the inventive engineered proteolytic enzyme variants, 
respectively. Here, variant x corresponds to Seq ID No: 75 comprising the same 
SDRs as variant f, i.e. SDRs according to Seq ID No. 89 (SDR1) and 95 (SDR2). 
Variants xi and xii correspond to derivatives thereof comprising the same SDR 
sequences. Remaining intact protein was was determined as a function of time. 
While the variants as well as human trypsin I digest human TNF-alpha, only 
trypsin shows activity on serum protein (Figure 17 a and b). This demonstrates 
the high TNF-alpha specificity of the inventive proteolytic enzymes and indicates 
their safety and accordingly their low side effects for therapeutic use. 

Example VI: Generation of an engineered proteolytic enzvme that specifically 
hvdrolvsis human VEGF. 



) 
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Human trypsin I was used as protein scaffold for the generation of an engineered 
proteolytic enzyme with high specificity towards human VEGF. The identification 
of SDR sites in human trypsin I was done as described above. Two insertion sites 
within the scaffold were choosen for SDRs. The protease variants containing two 
inserts with different sequences were expressed in Bacillus subtilis cells. The 
variant protease cells were separated to single cell clones and the protease 
expressing variants were screened as described above. The activity of the 
protease variants was determined as the rate of VEGF cleavage. 4ng of 
recombinant human VEGF165 was incubated with 0.18 pg of purified protease in 
PBS / pH 7.4 at room temperature. Aliquots were taken at the indicated time 
points and analysed on a polyacrylamide gel. The extend of cleavage was 
quantified by densitometric analysis of the bands. The activity is plotted over 
incubation time in Figure 18. Specific cleavage was controlled by further SDS 
polyacrylamide gel analyses. 
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Claims 

1 . Use of a protease with defined specificity for a target substrate for preparing a 
medicament for the treatment of a specific disease related to said target 
substrate. 

2. The use according to claim t , wherein 

(I) the protease hydrolizes the target substrate and thereby 

(i) eliminates or reduces one or more biological activities or physico- 
chemical properties or pharmacological properties of the target 
protein, and/or 

(ii) activates or increases one or more biological activities or physico- 
chemical properties or pharmacological properties of the target 
protein, and/or 

(iii) adds one or more biological activities or physico-chemical properties 
or pharmacolocical properties to the target protein; 

and/or 

(II) the target substrate hydrolyzed by the protease is a soluble protein, in 
particular a cytokine, such as the TNF-superfamily proteins, interleukines, 
interferons, chemokines and growth factors; a hormone; a toxin; an enzyme, 
such as oxidoreductases, transferases, hydrolases, lyases, isom erases and 
ligases; a structural protein, such as collagens; or an immunoglobulin; or a 
membrane associated protein, in particular a single pass transmembrane protein; 
a multipass transmembrane protein, such as G-protein coupled receptors, ion 
channels and transporters; a lipid-anchored membrane protein or a GPI- 
anchored membrane protein. 

3. The use according to claim 1 or 2, wherein the target substrate hydrolyzed by 
the protease is a TNF-superfamily protein. 

4. The use according to claim 3, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human tumor necrosis 
factor-alpha (hTNF-oc, SEQ ID NO: 96) or related molecules of the same structure 
class, preferably the peptide bonds between positions 31/32, 32/33, 44/45, 
45/46, 87/88, 128/129, 130/131, 140/141 and/or 141/142, most preferably 
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between positions 31/32, 32/33 and/or 45/46 of hTNF-a or between analogous 
positions in said related molecules; and/or 

(i) the medicament is suitable for the treatment of rheumatoid arthritis, 
inflammatory bowel diseases, psoriasis, Crohn's disease, Ulcerative colitis, 
diabetes type II, classical Hodgkin's Lymphoma (cHL), Grave's disease, 
Hashimoto's thyroiditis, Sjogren's syndrome, systemic lupus erythematosus, 
multiple sclerosis, Systemic inflammatory response syndrome (SIRS), multiple 
organ dysfunction syndrome (MODS), eosinophil, neurodegenerative disease, 
stroke, closed head injury, encephalitis, CNS disorders, asthma, rheumatoid 
arthritis, sepsis, vasodilation, intravascular coagulation, multiple organ failure or 
other diseases connected with hTNF-a. 

5. The use according to claim 4, wherein the protease is derived from a serine 
protease of the structural class S1, and preferably wherein the protease is 
derived from human trypsin I (SEQ ID NO:1). 

6. The use according to claim 4, wherein the protease has the a sequence shown 
in SEQ ID NO: 74 or 75 or a derivative thereof and is capable of hydrolyslng 
hTNF-a at positions 31/32 and/ or 32/33. 

7. The use according to claim 3, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human Tumor 
necrosis factor ligand superfamily member 5 (CD40-L) or related molecules of 
the same structural class, preferably the peptide bonds between positions 
117/118, 133/134, 145/146, 165/166, 200/201, 201/202, 207/208, 216/217 
and/or 243/244, most preferred between positions 133/134, 165/166, 201/202 
and/or 216/217 of CD40-L (SEQ ID NO:143), or between analogous positions in 
related molecules; and/or 

(ii) the medicament is suitable for the treatment of systemic lupus 
erythematosus, classical Hodgkin's Lymphoma (cHL) or other diseases connected 
with CD40-L 

8. The use according to claim 3, wherein 

(i) the engineered protease is capable of hydrolysing peptide bonds in human 
Macrophage migration inhibitory factor (hMIF) or related molecules of the same 
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structural class, preferably the peptide bonds between positions 16/17, 44/45, 
66/67, 73/74, 77/78, 88/89, 92/93 and/or 100/101, most preferred between 
positions 16/17 and/or 92/93 of hMIF (SEQ ID NO: 109) or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of inflammatory diseases, or 
other diseases connected with hMIF. 

9. The use according to claim 1 or 2, wherein the target substrate hydrolyzed by 
the protease is an interleukine. 

10. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in I nterleukin-1 beta (IL- 
1 B) or related molecules of the same structural class, preferably the peptide 
bonds between positions 24/25, 35/36, 46/47, 54/55, 74/75, 75/76, 76/77, 
77/78, 86/87, 88/89, 93/94, 94/95, 97/98 and/or 150/151, most preferred 
between positions 35/36, 75/76, 76/77, 88/89, 93/94, 94/95 and/or 150/151 of 
IL-1G (SEQ ID NO:l12), or between analogous positions in related molecules; 
and/or 

(iii) the medicament is suitable for the treatment of diabetes, brain inflammation 
in cancer, arthritis, autoimmune and inflammatory diseases or other diseases 
connected with I L- 1 B. 

1 1 . The use according to claim 9, wherein 

(i) the protease Is capable of hydrolysing peptide bonds in human Interleukin 2 
(hlL-2) or related molecules of the same structural class, preferably the peptide 
bonds between positions 20/21 , 32/33, 38/39, 43/44, 45/46 48/49, 49/50, 
54/55, 64/65, 76/77, 83/84, 84/85, 107/108, 109/110 and/or 120/121, most 

preferred between positions 109/110 of hlL-2 (SEQ ID NO:99) or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of T-cell leukemia, hairy cell 
leukemia, Crohn's disease, Ulcerative colitis, Graved disease, Hashimoto's 
thyroiditis, Sjogren's syndrome, systemic lupus erythematosus, multiple 
sclerosis, asthma and chronic obstructive pulmonary, classical Hodgkin's 
Lymphoma (cHL) or other diseases connected with hlL-2. 
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12. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human Interleukin 3 
(hlL-3) or related molecules of the same structural class, preferably peptide 
bonds between positions 21/22, 28/29, 36/37, 44/45, 46/47, 51/52, 63/64, 
66/67, 79/80, 94/95, 101/102, 108/109 and/or 109/110, most preferred 
between positions 21/22, 28/29, 46/47, 63/64, 66/67, 79/80 and/or 101/102 of 
hlL-3 (SEQ ID NO:148), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), eosinophilia or other diseases connected with hlL-3. 

13. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Interleukin 4 (hlL-4) or related molecules of the same structural 
class, preferably the peptide bonds between positions 4/5, 12/13, 
31/32, 37/38, 61/62, 62/63, 64/65, 91/92, 102/103, 121/122 

and/or 126/127, most preferred between positions 4/5, 61/62, 
62/63, 64/65 and/or 121/122 of hlL-4 (SEQ ID NO:118), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), Grave's disease, Hashimoto's thyroiditis, 
Sjogren's syndrome, Asthma, chronic obstructive pulmonary 
disease, allergic inflammatory reactions or other diseases connected 
with hlL-4. 

14. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
lnterleukin-5 (hlL-5) or related molecules of the same structural 
class, preferably the peptide bonds between positions 12/13, 32/33, 

67/68, 76/77, 77/78, 80/81, 83/84, 84/85, 85/86, 90/91, 91/92, 

92/93 and/or 98/99, most preferred between positions 90/91, 
91/92, 92/93 and/or 98/99 of hlL-5 (SEQ ID NO: 133), or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), asthma, chronic obstructive pulmonary disease, 
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eosinophilia, allergic inflammatory diseases or other diseases 
connected with hlL-5. 

15. The use according to claim 9, wherein 

(■) the protease Is capable of hydrolysing peptide bonds in human 

interleukin-6 (hlL-6) or related molecules of the same structural 
class, preferably the peptide bonds between positions 32/33, 35/36, 
55/56, 71/72, 129/130, 130/131, 132/133, 135/136, 141/142, 
161/162, 180/181 and/or 183/184, most preferred between 
positions 135/136 and/or 141/142 of hll_-6 (SEQ ID NO:134), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 

Lymphoma (cHL), breast cancer, renal cell carcinoma, multiple 
myeloma, lymphoma, leukemia, Grave's disease, Hashimoto's 
thyroiditis, Sjogren's syndrome, systemic lupus erythematosus, 
Systemic inflammatory response syndrome (SIRS) which leads to 
distant organ damage and multpile organ dysfunctionsyndrome 
(MODS), chronic obstructive pulmonary disease (COPD), 
Castleman's diseases, inflammatory bowel diseases, Crohn's disease 
or other diseases connected with hlL-6. 

16. The use according to claim 9, wherein 

(I) the protease is capable of hydrolysing peptide bonds in human 

Interleukin 8 (hlL-8) or related molecules of the same structural 
class, preferably the peptide bonds between positions 11/12, 15/16, 
45/46, 47/48, 52/53, 54/55, 60/61, 64/65 and/or 67/68, most 
preferred between positions 45/46 of hlL-8 (SEQ ID NO: 100), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 

Ulcerative colitis, classical Hodgkin's Lymphoma (cHL), Systemic 
inflammatory response syndrome (SIRS) which leads to distant 
organ damage and multpile organ dysfunction syndrome (MODS), 
chronic obstructive pulmonary disease (COPD), endometriosis, 
psoriasis, atherosclerotic lesions or other diseases connected with 
hlL-8. 
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17. The use according claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
lnterleukin-10 (hlL-10) or related molecules of the same structural 
class, preferably the peptide bonds between positions 24/25, 25/26, 
27/28, 28/29, 40/41, 44/45, 49/50, 57/58, 59/60, 84/85, 86/87, 
106/107, 107/108, 110/111, 130/131, 134/135, 137/138, 138/139 
and/or 144/145, most preferred between positions 24/25, 27/28, 
44/45, 49/50, 86/87, 137/138 and/or 144/145 of hlL-10 (SEQ ID 
NO: 135), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), diseases related to the suppression of cytotoxic 
T-cells or other diseases connected with ML- 10. 

18. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Interleukin 12 beta chain (hlL-128) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 14/15, 18/19, 29/30, 34/35, 87/88, 99/100, 102/103, 
104/105, 161/162, 174/175, 222/223, 225/226, 228/229, 238/239, 

268/269 and/or 293/294, most preferred between positions 18/19, 
34/35, 87/88 and/or 161/162 of hiL-128 (SEQ ID NO:97) or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 
classical Hodgkin's Lymphoma (cHL) or other diseases connected 
withhlL-128. 



19. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

Interleukin 13 (hlL-13) or related molecules of the same structural 
class, preferably the peptide bonds between positions 25/26, 62/63, 
65/66, 86/87, 87/88, 98/99, 108/109 and/or 111/112, most 
preferred between positions 87/88 of hlL-13 (SEQ ID NO: 119), or 
between analogous positions in related molecules; and/or 
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<ii) the medicament is suitable for the treatment of cancer, classical 

Hodgkin's Lymphoma (cHL), eosinophilia, asthma, chronic 
obstructive pulmonary disease, fibrosis, psoriasis, atopic dermatitis, 
Ulcerative colitis or other diseases connected with hlL-1 3. 

20. The use according to claim 9, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Interleukin 18 (hlL-18) or related molecules of the same structural 
class, preferably the peptide bonds between positions 17/18, 32/33, 
37/38, 39/40, 40/41, 53/54, 58/59, 79/80, 90/91, 93/94, 98/99, 
110/111, 120/121, 123/124, 131/132, 132/133, 142/143, 147/148 
and/or 157/158, most preferred between positions 37/38, 132/133, 
142/143 and/or 157/158 of hlL-18 (SEQ ID NO:98) or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 
inflammation liver injuries, pulmonary tuberculosis, plural 
tuberculosis, rheumatoid arthritis or other diseases connected with 
hlL-18. 

21. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is an interferone. 

22. The use according to claim 21 , wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Interferon-gamma (hi FN-?) or related molecules of the same 
structural class, preferably the peptide bonds between positions 

2/3, 6/7, 13/14, 21/22, 24/25, 34/35, 36/37, 37/38, 62/63, 68/69, 
83/84, 86/87, 90/91, 102/103, 107/108 and/or 108/109, most 
preferred between positions 13/14, 24/25, 37/38, 62/63, 68/69, 
102/103 and/or 107/108 of hlFN-? (SEQ ID NO:137), or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of classical Hodgkin's 
Lymphoma (cHL), Crohn's disease, type I diabetes or other diseases 
connected with I FN-?. 
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23. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a chemokine. 

24. The use according to daim 23, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human small 
inducible cytokine A2 (hCCL2) or related molecules of the same 
structural class, preferably the peptide bonds between positions 

3/4, 13/14, 18/19, 19/20, 24/25, 29/30, 38/39, 54/55, 56/57, 
58/59, 62/63, 65/66 and/or 68/69, most preferred between 
positions 19/20, 29/30, 38/39, 54/55 and/or 62/63 of hCCL2 (SEQ 
ID NO: 102), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 
Ulcerative colitis, or other diseases connected with hCCL2. 

25. The use according to claim 23, wherein 

(') the protease is capable of hydrolysing peptide bonds in human 

Eotaxin (hCCL11) or related molecules of the same structural class, 
preferably the peptide bonds between positions 11/12, 16/17, 
17/18, 22/23, 27/28, 33/34, 44/45, 47/48, 48/49, 52/53, 54/55, 
56/57, 60/61, 66/67, and/or 73/74, most preferred between 
positions 48/49 and/or 66/67 of hCCL11 (SEQ ID NO: 101), or 
between analogous positions in related molecules; and/or 

(ii) wherein the medicaments is suitable for the treatment of Crohn's 

disease and Ulcerative colitis, classical Hodgkin's Lymphoma (cHL), 
chronic pathophysiologic dysfunction, characterized by an influx 
mainly of Th2 cells, eosinophilia or other diseases connected with 
hCCL1 1 . 

26. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease Is a growth factor. 

27. The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

Vascular endothelial growth factor (hVEGF) or related molecules of 
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the same structural class, preferably the peptide bonds between 

positions 16/17, 19/20, 23/24, 34/35, 41/42, 56/57, 62/63, 63/64, 

64/65, 65/66, 82/83, and/or 84/85, most preferred between 
positions 23/24, 41/42, 63/64, 82/83 and/or 84/85 of hVEGF (SEQ 
ID NO: 103), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of all solid tumors and 

metastatic solid tumors, inflammatory breast cancer or other 
diseases connected with hVEGR 

28. The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Transforming growth factor beta 1 (hTGF-B1) or related molecules 
of the same structural class, preferably the peptide bonds between 
positions 23/24, 25/26, 26/27, 27/28, 37/38, 55/56 and/or 94/95, 
most preferred between positions 25/26, 55/56 and/or 94/95 of 
hTGF- B1 (SEQ ID NO: 104), or between analogous positions in 
related molecules; and/ or 

(ii) the medicament is suitable for the treatment of a variety of cancers 
including breast cancer, colorectal cancer and classical Hodgkin's 
Lymphoma (cHL); fibrosis, suppression of cell-mediated immunity, 
glaucoma, diffuse systemic sclerosis or other diseases connected 
with hTGF- 61. 



29. The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

Somatotropin (hGrowth hormone; hGH) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 8/9, 16/17, 19/20, 26/27, 33/34, 38/39, 41/42, 70/71, 
77/78, 94/95, 103/104, 112/113, 115/116, 116/117, 130/131, 
147/148, 154/155 and/or 178/179, most preferred between 
positions 112/113, 147/148 and/or 154/155 of GH (SEQ ID 
NO: 121), or between analogous positions in related molecules; 
and/or 
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(ii) the medicament is suitable for the treatment of acromegaly, 

diabetes and diabetic kidney disease including renal hypertrophy 
and glomerular enlargement, cardiovascular disorders or other 
diseases connected with hGH. 

30. The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in Insulin- like 
growth factor II (hIGF-ll) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
15/16, 23/24, 24/25, 34/35, 37/38, 38/39, 48/49 and/or 49/50, 
most preferred between positions 23/24 of h IGF 1 1 (SEQ ID 
NO: 122), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of diabetes, diabetic 
kidney disease or other diseases connected with hIGF-ll. 

31. The use according to claim 26, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Hepatocyte growth factor (hHGF) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
54/55, 60/61, 62/63, 63/64, 68/69, 76/77, 112/113, 123/124, 
134/135, 168/169, 198/199 and/ or 202/203, most preferred 
between positions 63/64, 68/69, 76/77, 168/169 and/or 202/203 of 
hHGF (SEQ ID NO: 120), or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of angiogenic 
disorders, hepatocellular carcinoma or other diseases connected 
with hHGF. 

32. The use according to claim 1 or 2, wherein the target substrate tydrolyzed 
by the protease is soluble hormone. 

33. The use according to claim 32, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

Insulin (hlnsulin) or related molecules of the same structural class, 



WO 2004/113522 



108 



PCT/EP2004/051173 



preferably the peptide bonds between positions 16/17 and/or 22/23 
of hlnsulin B chain (SEQ ID NO: 1 05), and/or between positions 
14/15 of Insulin A chain (SEQ ID NO: 106) or between analogous 
positions in related molecules; and/or 
(ii) the medicament is suitable for the treatment of insulin overdosage 

or other diseases connected with hlnsulin. 

34. The use according to claim 32, wherein 

(i) the engineered protease is capable of hydrolysing peptide bonds in 
human Ghrelin (Ghrelin) or related molecules of the same structural 
class, preferably the peptide bonds between positions 1/2, 2/3, 3/4 
and/or 4/5 of hGhrelin (SEQ ID NO: 107), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of obesity or other 
diseases connected with hGhrelin. 

35. The use according to claim 32, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

angiotenslnogen (angiotensin) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
1/2, 3/4, and/or 7/8, most preferred between positions 3/4 of 
angiotensin {SEQ ID NO: 108), or between analogous positions in 
related molecules; and/or 

(ii) the medicament is suitable for the treatment of essential 

hypertension and other diseases connected with angiotensin. 

36. The use according to claim 32, wherein 

(i) the engineered protease is capable of hydrolysing peptide bonds in 

human leptin (leptin) or related molecules of the same structural 
class, preferably the peptide bonds between positions 8/9, 9/10, 

15/16, 23/24, 40/41, 53/54, 71/72, 85/86, 94/95, 108/109 and/or 

141/142, most preferred between positions 9/10, 40/41, 71/72, 
94/95 and/ or 108/109 of leptin (SEQ ID NO: 127), or between 
analogous positions in related molecules; and/or 
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(ii) the medicament is suitable for the treatment of obesity or other 

diseases connected with ieptin. 

37. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a binary toxin. 

38. The use according to claim 37, wherein 

(i) the engineered protease is capable of hydroiysing peptide bonds in 
Protective antigen precursor (PA-83) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 72/73, 73/74, 92/93, 93/94, 131/132, 149/150, 178/179, 
213/214, 214/215, 387/388, 425/426, 426/427, 427/428, 453/454, 
520/521, 608/609, 617/618, 671/672, 679/680, 680/681, 683/684 
and/or 684/685, most preferred between positions 72/73, 73/74, 
93/94, 149/150, 387/388, 425/426, 427/428 and/or 683/684 of 
hPA-83 (SEQ ID NO: 123), or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of anthrax infection or 
other diseases connected with PA-83. 

39. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a protease, in particular a serine, cysteine or metallo protease. 

40. The use according to claim 39, wherein 

(i) the protease is capable of hydroiysing peptide bonds in human 
plasminogen (plasminogen) or related molecules of the same 
structural class, preferably the peptide bonds between position 
580/581 of plasminogen (SEQ ID NO:140), or between analogous 
positions in related molecules; and/ or 

(ii) the medicament is suitable for the treatment of thrombosis or other 
diseases connected with. plasminogen. 

41 . The use according to claim 39, wherein 

(i) the protease is capable of hydroiysing peptide bonds in human 

Prothrombin (thrombin) or related molecules of the same structural 
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class, preferably the peptide bonds between positions 198/199, 
327/328, 363/364, most preferred between positions 327/328 
and/or 363/364 of thrombin (SEQ ID NO: 149), or between 
analogous positions in related molecules; and/or 
(ii) the medicament is suitable for the treatment of bleeding or other 

diseases connected with thrombin. 

42. The use according to claim 39, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human beta- 
secretase or related molecules of the same structural class, 
preferably the peptide bonds between positions 61/62, 64/65, 
130/131, 131/132, 159/160, 216/217, 238/239, 239/240, 246/247, 
256/257, 259/260, 330/331, 365/366, 378/379 and/or 381/382, 
most preferred between positions 61/62, 131/132, 246/247, 
259/260, 365/366 and/or 378/379 of bet a- secret ase (SEQ ID 
NO: 139), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of Alzheimer or other 
diseases connected with beta-secretase precursor. 

43. The use according to claim 39, wherein 

<0 the protease is capable of hydrolysing peptide bonds in human 

matrix metalloproteinase-2 (hMMP-2) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 62/63, 68/69, 75/76, 76/77, 79/80, 88/89, 110/111, 
112/113, 115/116, 120/121, 164/165, 254/255, 267/268, 296/297, 
324/325, 325/326, 382/383, 383/384, 470/471, 500/501, 550/551, 
564/565, 595/596, 597/598, 608/609, 646/647, 649/650 and/or 
650/651, most preferred between positions 68/69, 115/116, 
120/121, 164/165, 325/326, 383/384, 470/471, 500/501, 595/596, 
608/609 and/or 650/651 of hMMP-2 (SEQ ID NO:131), or between 
analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of a variety of cancers 

including bladder cancer, breast tumor cancer, gastric cancer, lung 
cancer or other diseases connected with hMMP-2. 
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44. The use according to claim 39, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
matrix metalloproteinase-9 (hMMP-9) or reiated molecules of the 
same structural class, preferably the peptide bonds between 
positions 41/42, 42/43, 106/107, 113/114, 134/135, 160/161, 
162/163, 163/164, 222/223, 226/227, 265/266, 266/267, 267/268, 
284/285, 309/310, 321/322, 322/323, 324/325, 356/357, 380/381, 
433/434 and/or 440/441, most preferred between positions 
160/161, 163/164, 226/227, 284/285, 321/322, 322/323 and/or 
433/434 of hMMP-9 (SEQ ID NO: 132), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of a variety of cancers 
including bladder cancer, breast tumor cancer, gastric cancer, lung 
cancer or other diseases connected with hMMP-9. 

45. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a type-1 single pass transmembrane protein. 

46. The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in HIV 

membrane glycoprotein (GP120) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
97/98, 99/100, 107/108, 113/114, 117/118, 227/228, 231/233, 
279/280, 335/336, 337/338, 368/369, 412/413, 419/420, 429/430, 
444/445, 457/458, 474/475, 476/477, 477/478, 485/486 and/or 
490/491, most preferred between positions 99/100, 368/369, 
412/413, 419/420, 444/445 and/or 490/491 Of GP120 (SEQ ID 
NO: 124) or between analogous positions in related molecules; 
and/or 

(II) the medicament is suitable for the treatment of AIDS or HIV 

Infection or other diseases connected with GP120. 

47. The use according to claim 45, wherein 
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(i) the protease is capable of hydrolysing peptide bonds human 
Cytotoxic T- lymphocyte protein 4 (CTLA-4) or related molecules of 
the same structural class, preferably the peptide bonds between 
positions 14/15, 28/29, 33/34, 38/39, 41/42, 62/63, 72/73, 85/86, 
95/96, 100/101, 105/106, 119/120, 125/126 and/or 127/128, most 
preferred between positions 14/15, 28/29, 38/39, 41/42, 62/63 
and/or 85/86 of CTLA-4 (SEQ ID NO: 144), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of breast cancer or 
other diseases connected with CTLA-4. 



48. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Integrin alpha-2 (hVLA-2) or related molecules of the same 
structural class, preferably the peptide bonds between positions 

160/161, 174/175, 201/202, 219/220, 231/232, 232/233, 233/234, 
243/244, 259/260, 264/265, 268/269, 288/289, 292/293, 294/295, 
298/299, 301/302, 310/311 and/or 317/318, most preferred 
between positions 160/161, 174/175, 201/202, 219/220, 243/244, 
264/265, 292/293 and/ or 294/295 of hVLA-2 (SEQ ID NO: 147), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of renal tumors, uveal 
melanomas, gastrointestinal tumors or other diseases connected 
with hVLA-2. 

49. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 

Vascular endothelial growth factor receptor 1 (hVEGFR 1) or related 
molecules of the same structural class, preferably the peptide bonds 
between positions 175/176, 180/181, 187/188, 189/190, 190/191, 
224/225 and/or 331/332, most preferred between positions 
189/190 and/or 331/332 of hVEGFR 1 (SEQ ID NO: 114), or 
between analogous positions in related molecules; and/or 



WO 2004/113522 



113 



PCT/EP2004/051173 



(ii) the medicament is suitable for the treatment of solid tumors and 

metastatic solid tumors, astrocytic brain tumors, pancreatic cancer, 
metastatic renal cancer or other diseases connected with hVEGFR 1 . 

50. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Vascular endothelial growth factor receptor 2 (hVEGFR 2) or related 
molecules of the same structural class, preferably the peptide bonds 
between positions 214/215, and/or 323/324, most preferred 
between positions 214/215 of hVEGFR 2 (SEQ ID NO:115), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of solid tumors and 
metastatic solid tumors pancreatic cancer, metastatic renal cancer, 
metastatic CRC, or other diseases connected with hVEGFR 2. 

51 . The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Epidermal growth factor receptor (hEGFr) or related molecules of 
the same structural class, preferably the peptide bonds between 
positions 20/21, 29/30, 48/49, 74/75, 155/156, 165/166, 167/168, 
202/203, 206/207, 220/221 , 223/224, 246/247, 251/252, 254/255, 
269/270, 270/271, 297/298, 304/305, 305/306, 357/358, 364/365, 
369/370, 430/431, 443/444, 454/455, 455/456, 463/464, 465/466, 
476/477, 507/508 and/ or 509/510, most preferred between 
positions 155/156, 246/247, 251/252, 254/255, 269/270, 270/271, 
297/298, 304/305, 306/307, 364/365 and/or 454/455 of hEGFr 
(SEQ ID NO: 11 6), or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of bladder cancer, 
breast cancer, cervical cancer, colorectal cancer, endometrial 
cancer, oesophageal cancer, head and neck cancer, gastric cancer, 
non-small-cell lung carcinoma, ovarian cancer or other diseases 
connected with hEGFr. 

52. The use according to claim 45, wherein 



WO 2004/113522 



114 



PCT/EP2004/05U73 



(') the protease is capable of hydrolysing peptide bonds in human 

Epithelial cell adhesion molecule (Ep-CAM) or related molecules of 
the same structural class, preferably the peptide bonds between 
positions 14/15, 19/20, 25/26, 29/30, 30/31, 33/34, 44/45, 55/56, 
67/68, 70/71, 90/91 and/or 100/101, most preferred between 
positions 14/15, 30/31, 44/45, 70/71 and/or 100/101 of Ep-CAM 
(SEQ ID NO: 125) or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of colorectal cancer or 

other diseases connected with Ep-CAM. 

53. The use according to claim 45, wherein 

(') the protease is capable of hydrolysing peptide bonds in human 

Insulin-like growth factor I receptor (hlGF-1r) or related molecules 
of the same structural class, preferably the peptide bonds between 
positions 39/40, 59/60, 115/116, 132/133, 146/147, 171/172, 
191/192, 250/251, 262/263, 270/271, 290/291, 306/307, 307/308, 
335/336, 336/337, 403/404, 405/406, 455/456 and/or 470/471, 
most preferred between positions 39/40, 262/263, 306/307, 
307/308, 335/336, 405/406 and/or 470/471 of hlGF-1r (SEQ ID 
NO: 126) or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of a variety of cancers 

Including breast cancer or other diseases connected with hlGF-1r. 

54. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human T- 

cell surface antigen CD2 (hCD2) or related molecules of the same 
structural class, preferably the peptide bonds between positions 

16/17, 20/21, 28/29, 29/30, 40/41, 42/43, 43/44, 48/49, 49/50, 
51/52, 54/55, 58/59, 63/64, 69/70, 76/77, 89/90 and/or 91/92, 
most preferred between positions 28/29, 40/41, 43/44, 51/52, 
76/77 and/or 89/90 of hCD2 (SEQ ID NO: 128) or between 
analogous positions in related molecules; and/or 
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(ii) the medicament is suitable for the treatment of psoriasis or other 

diseases connected with hCD2. 

55. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human T 
cell surface glycoprotein CD4 (hCD4) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 53/54, 63/64, 88/89, 166/167, 167/168, 173/174, 
206/207, 219/220, 224/225, 226/227, 230/231, 244/245, 251/252, 
252/253, 322/323, 329/330 and/or 334/335, most preferred 
between positions 88/89, 173/174, 206/207, 219/220, 230/231, 
251/252 and/ or 252/253 of hCD4 (SEQ ID NO: 129), or between 
analogous positions in related molecules; 

(ii) preferably the medicament is suitable for the treatment of psoriasis, 
transplant rejection, graft-versus-host colitis, autoimmune 
disorders, rheumatoid arthritis or other diseases connected with 
hCD4. 

56. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Integrln alpha-L (hCDIla) or related molecules of the same 
structural class, preferably the peptide bonds between positions 

145/146, 152/153, 156/157, 159/160, 160/161, 177/178, 178/179, 
189/190, 190/191, 191/192, 193/194, 197/198, 200/201, 221/222, 
229/230, 249/250, 253/254, 268/269, 290/291, 297/298, 304/305 
and/or 305/306, most preferred between positions 145/146, 
159/160, 160/161, 189/190, 229/230, 249/250, 268/269, 297/298, 
304/305 and/or 305/306 of hCD11a (SEQ ID NO: 130), or between 
analogous positions In related molecules; and/or 

(ii) the medicament is suitable for the treatment of psoriasis or other 
diseases connected with hCD1 1 a. 

57. The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Interferon-gamma receptor alpha chain (hlFN-?-R1) or related 
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molecules of the same structural class, preferably the peptide bonds 
between positions 49/50, 52/53, 58/59, 62/63, 72/73, 76/77, 
106/107, 107/108, 116/117, 122/123, 174/175, 176/177, 179/180, 
215/216 and/or 222/223, most preferred between positions 49/50, 
72/73, 116/117, 122/123, 174/175 , 176/177 and/or 215/216 of 
hlFN-?-R1 (SEQ ID NO: 136), or between analogous positions in 
related molecules; and/or 
(ii) the medicament is suitable for the treatment of classical Hodgkin's 

Lymphoma (cHL), type I diabetes or other diseases connected with 
hlFN-?-R1. 

58. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Platelet membrane glycoprotein llb/IIla (GPIIb/llla) or related 
molecules of the same structural class, preferably the peptide bonds 
between positions 67/68, 76/77, 91/92, 129/130, 143/144, 
144/145, 179/180, 181/182, 198/199, 208/209, 209/210, 216/217, 
239/240, 261/262, 410/411, 434/435, 532/533, 556/557, 557/558, 
596/597, 597/598, 621/622, 650/651, 651/652, 661/662, 662/663 
and/or 689/690, most preferred between positions 67/68, 76/77, 
179/180, 261/262, 410/411, 434/435, 650/651, 662/663 and/or 
689/690 of GPIIb/llla (SEQ ID NO:141), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of unstable angina, 
carotid stenting, ischemic stroke, peripheral vascular diseases, 
angiogenesis- related diseases, disseminating tumors or other 
diseases connected with GPilb/lila. 

59. The use according to claim 45, wherein 

(0 the protease is capable of hydrolysing peptide bonds in human 

Intercellular adhesion molecule-1 (ICAM-1) or related molecules of 
the same structural class, preferably the peptide bonds between 
positions 26/27, 40/41, 60/61, 71/72, 88/89, 97/98, 102/103, 
128/129, 131/132, 132/133, 149/150, 150/151, 151/152, 160/161, 
164/165 and/or 166/167, most preferred between positions 71/72, 
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88/89, 102/103, 150/151, 151/152, 160/161 and/or 166/167 of 
ICAM-1 (SEQ ID NO: 142), or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease or 
other diseases connected with ICAM-1 . 

60. The use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds human TGF- 
beta receptor type II (hTGF-B Rll) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
32/33, 34/35, 35/36, 66/67, 67/68, 69/70, 82/83, 103/104, 
104/105, 105/106, 118/119, 122/123 and/or 130/131, most 
preferred between positions 32/33, 34/35, 66/67, 69/70, 104/105, 
122/123 and/or 130/131 of hTGF-B Rll (SEQ ID NO:145), or 
between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of diffuse systemic 
sclerosis or other diseases connected with hTGF-B Rll. 

61 . The. use according to claim 45, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Membrane cofactor protein (hMCF) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
15/16, 17/18, 25/26, 27/28, 31/32, 32/33, 35/36, 47/48, 48/49, 
58/59, 67/68, 69/70, 70/71, 110/111, 119/120, 125/126 and/or 
130/131, most preferred between positions 15/16, 32/33, 47/48, 
48/49, 70/71, 119/120 and/or 125/126 of hMCP(SEQlD NO:146), 
or between analogous positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of renal tumors, uveal 
melanomas, gastrointestinal tumors or other diseases connected 
with hMCP. 

62. The use according to claim 1 or 2, wherein the target substrate hydrolyzed 
by the protease is a protease-activated receptor or hydroxytryptamine receptor. 

63. The use according to claim 62, wherein 
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(i) the protease is capable of hydrolysing peptide bonds in human 
Protease activated receptor 1 (hPAR1) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 46/47, 50/51, 51/52, 52/53 and/or 58/59 of hPAR1 (SEQ 
ID NO: 110), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of thrombosis or other 
diseases connected with hPAR1 . 

64. The use according to claim 62, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Protease activated receptor 2 (hPAR2) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 41/42, 44/45, 51/52, 59/60 and/or 62/63 of hPAR2 (SEQ 
ID NO:111), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of Crohn's disease, 
Ulcerative colitis and Inflammatory bowel disease, asthma, 
inflammation associated pain and arthritis or other diseases 
connected with hPAR2. 

65. The use according to claim 62, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
Protease activated receptor 4 (hPAR4) or related molecules of the 
same structural class, preferably the peptide bonds between 
positions 57/58, 59/60, 68/69, 74/75 and/ or 78/79 of hPAR4 (SEQ 
ID NO:113), or between analogous positions in related molecules; 
and/or 

(ii) the medicament is suitable for the treatment of thrombosis or other 
diseases connected with hPAR4. 

66. The use according to claim 62, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 5- 

hydroxytryptamine 1A receptor (h5-HT-1A) or related molecules of 
the same structural class, preferably the peptide bonds between 
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positions 101/102, 102/103, 181/182 and/or 370/371 of h5-HT-1A 
(SEQ ID NO: 11 7) or between analogous positions in related 
molecules; and/or 

(ii) the medicament is suitable for the treatment of irritable bowel 

syndrome or other diseases connected with h5-HT-1A. 

67. The use according to claim 1 or 2, wherein 

(i) the protease is capable of hydrolysing peptide bonds in human 
carcinoembryonic antigen (hCEA) or related molecules of the same 
structural class, preferably the peptide bonds between positions 
17/18, 69/70, 71/72, 74/75, 77/78, 98/99, 116/117, 126/127 
and/or 128/129 of hCEA (SEQ ID NO: 1 38), or between analogous 
positions in related molecules; and/or 

(ii) the medicament is suitable for the treatment of colon cancer or 
other diseases connected with hCEA. 

68. The use according to any one of claims 1 to 67, wherein the protease is an 
engineered protease,preferab!y an engineered protease characterized by a 
combination of the following components: 

(a) a protein scaffold capable to catalyze at least one chemical reaction on at 
least one target substrate and being derived from one or more proteins, and 

(b) one or more specificity determining regions (SDRs) located at sites in the 
protein scaffold that enable the resulting engineered protein to discriminate 
between at least one target substrate and one or more different substrates, and 
wherein the SDRs are essentially synthetic peptide sequences. 

69. The use according to claim 68, wherein 

(!) the SDRs (b) have a length between one and 50 amino acid residues, 
preferably have a length between 2 and 20 amino acid residues, more preferably 
a length between 2 and 10 amino acid residues, even more preferably a length 
between 3 and 8 amino acid residues, and wherein the number of SDRs is at 
least one, preferably more than one, more preferably between two and eleven, 
most preferably between two and six; and/or 

(II) the protein scaffold (a) is comprised of one or more polypeptide segments 
being derived from same or different 
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(i) proteins encoded by a gene of viral or prokaryotic or eukaryotic origin, and/or 

(ii) native enzymes, mutated variants or truncated derivates thereof, and/ or 

(iii) mammalian enzymes, preferably human enzymes. 

70. The use according to claim 68 or 69, wherein the protein scaffold (a) is 
derived from a protease selected from the group consisting of aspartic, cysteine, 
serine, metallo and threonine proteases, 

even more preferably the protein scaffold (a) is derived from a serine protease of 
the structural class S1 , S8, S11, S21, S26, S33 or S51, most preferably from 
class S1 or S8, a cysteine protease of the structure class C1, C2, C4, C10, C14, 
C19, C47, C48 or C56, most preferably from class C14, or an aspartic protease of 
the structural class A1, A2 or A26, most preferably from class A1, or a 
metalloprotease of the strucutral class M4 or M1 0. 

71 . The use according to any one of claims 68 to 70, wherein 

(i) the protein scaffold (a) is derived from a serine protease of the 
structural class S1 ; and/or 

(ii) the SDRs are located at one or more positions selected from the 
group of positions that correspond structurally or by amino acid 
sequence homology to the regions 18-25, 38-48, 54-63, 73-86, 
122-130, 148-156, 165-171 and 194-204 in human trypsin I having 
the amino acid sequence shown in SEQ ID NO:1, and preferably at 
one or more positions selected from the group of positions that 
correspond structurally or by amino acid sequence homology to the 

regions 20-23, 41-45, 57-60, 76-83, 125-128, 150-153, 167-169 
and 197-201 in human trypsin I. 

72. The use according to claim 71 , wherein 

(i) the protein scaffold (a) is derived from the serine protease trypsin, 
preferably human trypsin I having the amino acid sequence shown 
In SEQ ID NO:1, or the amino acid sequence SEQ ID NO:1 
comprising one or more of the following amino acid substitutions 
E56G, R78W, Y131F, A146Tand C183R; and 

(ii) at least one of two SDRs are located in the scaffold, a first SDR 
having a length of up to 6 amino acids and bein inserted between 
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residues 42 and 43, and a second SDR having a length of up to 5 
amino acids and bein inserted between residues 123 and 124 
(numbering relative to human trypsin having the amino acid 
sequence shown in SEQ ID NO:1). 

73. The use according to claim 72, wherein 

(i) one of the peptide sequences of the following group: SEQ ID NO: 
72, 78, 79/80, 84, 85, 86, 87, 88, and 89 is inserted as the first 
SDR between residues 42 and 43; and/or one of the peptide 
sequences of the following group: SEQ ID NO: 73, 81, 82, 83, 90, 
91, 92, 93, 94, and 95 Is inserted as the second SDR between 
residues 123 and 124; or 

(ii) the engineered enzyme comprises an amino acid sequence as 
shown in SEQ ID NO: 74, or SEQ ID NO: 75. 

74. The use according to any one of claims 68 to 70, wherein 

(i) the protein scaffold (a) is derived from a serine protease of the 
structural class S8; and/ or 

(ii) (ii) the SDRs are located at one or more positions selected from the 
group of positions that correspond structurally or by amino acid 
sequence homology to the regions 6-17, 25-29, 47-55, 59-69, 101- 
111, 117-125, 129-137, 139-154, 158-169, 185-195 and 204-225 
in subtilisin E from Bacillus subtll/s having the amino acid shown in 
SEQ ID NO: 7, and preferably at one or more positions selected from 
the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 59-69, 101-111, 129-137, 158- 
1 69 and 204-225in subtilisin E for Bacillus subtills. 

75. The use according to any one of claims 68 to 70, wherein 

(i) the protein scaffold (a) is derived from an aspartic protease of the 
structural class A1 ; and/ or 

(ii) the SDRs are located at one or more positions selected from the 
group of positions that correspond structurally or by amino acid 
sequence homology to the regions 6-18, 49-55, 74-83, 91-97, 112- 
120, 126-137, 159-164, 184-194, 242-247, 262-267 and 277-300 
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in human pepsin having the amino acid sequence shown in SEQ ID 
NO:11, and more preferably at one or more positions selected from 
the group of positions that correspond structurally or by amino acid 
sequence homology to the regions 10-15, 75-80, 114-118, 130- 
134, 186-191 and 280-296 in human pepsin. 

76. The use according to any one of claims 68 to 70, wherein 

(i) the protein scaffold (a) is derived from a cysteine protease of the 
structural class C14; and/or 

(ii) the SDRs are located at one or more positions selected from the 
group of positions that correspond structurally or by amino acid 
sequence homology to the regions 78-91, 144-160, 186-198, 226- 
243 and 271-291 in human caspase 7 having the amino acid 
sequence of SEQ ID NO: 14, and preferably at one or more positions 
selected from the group of positions that correspond structurally or 
by amino acid sequence homology to the regions 80-86, 149-157, 
1 90-1 94 and 233-238 of human caspase 7. 

77. The use according to any one of claims 1 to 76, wherein the protease 
comprises 

(i) at least one further proteinacious component, preferably being selected from 
the group consisting of binding domains, receptors, antibodies, regulation 
domains, pro-sequences, and fragments thereof, and/or 

(ii) at least one further functional component, preferably being selected from the 
group consisting of polyethylenglycols, carbohydrates, lipids, fatty acids, nucleic 
acids, metals, metal chelates, and fragments or derivatives thereof. 

78. The use according to any one of claims 1 to 77, wherein the protease Is 
obtainable by a method comprising at least the following steps: 

(a) providing a protein scaffold which catalyzes at least one chemical reaction on 
at least one target substrate, 

(b) generating a library of enzymes or isolated enzymes by combining the 
protein scaffold from step (a) with variants of one or more fully or partially 
random peptide sequences at sites in the protein scaffold that enable the 
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resulting enzyme to discriminate between at least one target substrate and one 
or more different substrates, and 

(c) selecting out of the (library of) enzymes generated in step (b) one or more 
enzymes that have defined specificities towards at least one target substrate. 

79. Use of an enzyme according to any one of claims 1 to 78 for in vivo or in 
vitro diagnostic purposes. 

80. A pharmaceutical or diagnostic composition comprising one or more enzymes 
according to any one of claims 1 to 78, said pharmaceutical or diagnostic 
composition optionally comprising pharmaceutical^ or diagnostically acceptable 
carrier(s), excipient(s) and/or auxiliary agent(s). 

81. A method for cleaving a target substrate as defined in claims 1 to 67 in vivo 
or in vitro, which comprises contacting the target substrate with a protease as 
defined in claims 1 to 78. 

81 . A method for treatment of a disease In a patient connected with a specific 
target substrate as defined in anyone of claims 1 to 67, which comprises 
administering the patent a suitable amount of a protease with defined specificity 
for said specific target substrate as defined in anyone of claims 1 to 78. 
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Trypsin IVGGYNCEKNSVPYQVSI, NSGYHF-CGGSIiINEQWWSAGHCY 

a-Thrombin IVEGSDAEIGMSPWQVMLFRKSPQELL-CGASLISDRWVLTAAHCLLYPP 

Enteropept idase IVGGSKAKEGAWPWWGL YYGGRIjLCGASLVSSDWLVSAABCVYGRN 

** * * * * ** ** * * ** 



Trypsin KSRIQVRLGEH NIEVIiEGNHEQFINAAKIIRHPQYD-RKTL 

a-Thrombin WDKNFTENDLLVRI GKH SRTRYERNIEKISMLEKI YI HP RYNWRENL 

Enteropept idase LE PSKWTAIX.GLBMKSNLTSPQTV— PRLID — EI VINP HYN-RRRK 

-1 * * * * * * 



Trypsin NNDIMLIKLSSRAVINARVSTISUPTA PPAT GTKCLISGWG 

a-Thrombin DRDIALMKLKKPVAFSDYIHPVCLPDR ETAAS LLQAGY KGRVTGWG 

Enteropept idase ENDIAMMHLEFKVNYTDYIQPICLPEENQVFPP GRNCSIAGWG 

** * ** 2 * *** 



Trypsin N TASSGADYPDELQCIiDAPVLSQAKCEASYPG-KITSNMFCVGFL 

a-Thrombin NLKETWTANVGKGQP5 VIiQWNLP I VERPVCKDSTRI -RI TDNMFCAGYK 

Enteropeptidase T WYQGTT-ANILQEADVPLLSNERCQQQMPEYNITENMICAGYE 

— 3 — * ** * * ** ** * * 

Trypsin — EGGK — DSCQGDSGGPWCNGQ LQ GWSWGDGCAQKNKP 

a-Thrombin PDEGKRGD ACEGD SGGPFVMKS P FNNRWY QMG I VS WGEGCDRDG KY 

Enteropeptidase -EGGI — DSCQGDSGGPLMCQENNRWFLA GVTSFGYKCALPNRP 

* * * ****** ^ * * * * 

Trypsin GVYTKVYNYVKWI KNT I AANS - 

a-Thrombin GF YTHVFRLKKWI QKVI DQFGE 

Enteropeptidase GV YARVS RF TEWI QSFLH 

* * * ** 
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sub IAHEYAQSV — PY GISQ — I KAPALB SQGY 

furin VAKRRAKRD — VYQEPTDPKFPQQWYI^GVTQRDLNVKEAWAQGF 

PC_SKl EKERSKRSALRDS ALNL — FNDPMWKQQWYI*QDTRMTAAI>PKLDL 

PC_SK5 NTHPCQ SD — MNIEGAWKRGY 

1 2— 



sub 
furin 
PC_SKl 
PC_SK5 



-TGSNVKVAVIDSGIDSSHPDL — NVRGGAS — FVPSETN- 
TGHGIWSILDDGIEKNHPDIAGNYDPGAS- -FDVNDQD- 



PDPQ 

H VI P VWQKG I TGKGWITVIjDDGLEVJNH TD I Y AN YD PEAS YDFNDNDHD P 

TGKNI WTI LiDDG IERTHPDX. MQNYDA- -LASCDVNGNDLDPMP 

* * * * 3 



sub 
furin 
PC_SK1 
PC SK5 



YQ DGSS HGTHVAGTIA All— NNS I GVLGVS P SAS L YAVKVLD S ■ 

PRYTQM NDNR HGTRCAGEVA AVANNGVCGVGVA YNARI GGVRMLD- 



FPRYDPTNENK HGTRCAGE I AMQAN— NHKCG V— GVA YN S K VGG I RMLDG 

"RY DAS NENKH GTRCAGEVA AAANNSHCTVGIAFNAKIGGVRMIiDGDVTD 



-GPT GSTA- 



-LKT 



sub -TGSGQYSWIINGIE-WAISNNMDVINMSLG 

furin GEVTDAVEARS-LGLNPNHIHIYSASW GPEDDGKTVDGPARXAEE 

PC_SK1 — IVTDAIEASSIGFN PGHVDIYSASWGPNDDGKTVEGP GRUV QKAFE 

PC_SK5 MVEAKS VSFNPQHVHI YS ASWGPDDDGKTVD GPA — P I,T RQ 

_ 5 6 7 g_ 



sub 
furin 

FC_SK1 
PC SK5 



— WDKAVSS G IWAAAAGNE GS S GS TS TVGYPAKYP S T I AVGAV 

— AFFRGVSQGRGGLGSIFVWASGNGGREHDSCNCDGYTNSI-YTLSISSATQFGNV 

YGVKQGRQGKG S I FVWASGNGGRQ GDNCDCD GYTDS I YTI SI 

— AFENG VRMGRRGIiG S VF VWAS G NGGRS K DHCS CDGY TN S I— YT I S ISS TAESGKKPWY 



sub 
furin 
PC _SK1 
PC_SK5 



N SSNQR — 

— P WYSEACS STLA- • 



ASFSSAG— SELDVMAPGVSIQSTLPGGTYGAY 

TT Y S S GN QNEKQ I VTTDLRQKCT E3H 

— S- SASQQGLSPWYAEKCS STLATS YSSG— DYTDQRI TSADLHNDCT ETH 

LEE CS STL ATT YSSG — ESYDKKI ITTDLRQRCTDNH 

10 * ±1 



sub NGTSMATPHVAGAAALIL — SKHP — TWTNAQVRDRLES TAT Y — LG-NSFYYGKGLINV 

f u rin TGTS ASAP LAAGI I ALTLE AN KN3j — TWRDMQHLWQTS KP AH — LN -ADD WATNG VGRK 

PC_SK1 TGTSASAPIAAGIFALAL — EANP — NLTWKDMQHLWWTSEYDPLA -NNPGWKKNGAGL 

PC_SKS TGTSASAPMAAGI IALAIr— EANPFLTWRDVQHVI VRTSRAGH — LNANDWKTNAAGFKV 



Fig. 4 



C 



WO 2004/1 13522 



PCT/EP2004/051173 



6/18 



Peps . TLVDEQP iENYLDMEYFGTIGIGTPAQDFTWFDTGSSNLWVPSVYCSSL— ACTN 

Seer. EMVDN LRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNB'AVGAAPHPFL 

Cath . P AVTEGP I PEV1KN YMDAQY YGEI GIGTPP QCFTWFDTG S SNLWVP S I H CKLLDI ACW I 

* 1 * * * * ****** * . o 



Peps . HNRFNPEDS S T YQ STS ETVS I T YGTGSMTGI LGYDTVQV G GISDTN 

Seer . HRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSI PHGP NVTVRA 

Cath . HHKYNSDKSSXYVKKGTSFDIHYGSGSLSGYIiSQDTVSVPCQSASSASAIiG GVKVER 

- **** 3 * + * A 



Peps . QIFGLSETEPGSFLYYAPFDGILGLAYPSIS — S SGATP VFDN I WNQGLVS QDLFS V YLS 

Seer. NIAAITESDK-FFINGSNWEGILGIAYAEIARPDDSLEPFFDSLVKQTHVP-NIiFSLQLC 
Cath . QVFGEATKQPG I TFI AAKFD GI LGMAYPRI S — VNNVLPVFDNLMQQKIjVDQNIFSFYLS 

5 **** g ** * * ** * 



Peps . ADD KS — GSWIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGETI 

Seer. GAGFPLNQSEVLASV — GGSWIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDIi 

Cath . RDP D AQPGGELMLGGTDSKY YKGS L SYLJNVTRKA Y WQVHLDQVE VAS G LT 

7 ** * * **-* g 



Peps . A — CAEGC — QAIVDTGTSI>LTGPTSPIANIQSDIGASENSD GDMWSCSAI 

Seer . KMDCKEYNYDKSIVDSGTTNI^IPKKVFEAAVKSIKAASSTEPCFPDGFVTI^ 

Cath . L — CKEGC — EAIVDTGTSLMVGPVDEVRELQKAI GAVPLIQ GEYMI P CEKV 

* * *** ** * * * 5 * 



Peps . SSLPDIVFTI NGVQYPVPPSAYILQSEGS CISGFQGMNVP— TESG 

Seer . GTTPWNIFPVISLYLMGEVTNQSFRITILPQQYI.RPVEDV AT S QDDC YKFAI S QS S 

Cath . ST LP AI TLRL GGKGYKLSPEDYTLKVSQAGKTLCLSGFMGMDIP-PPSG 

* X0 * * X1 

Peps . ELWIUGDVFIRQYFTVFDRAWNQVGLAPVA 

Secx. TGTVMGAVI ME GF YWFDRARKRI GFAVS A 

Cath . PLWILGDVF I GRYYTVFDRDNNRVGFAE AA 

— * * ***★ * * * 



Fig. 6 
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01 MLEADDQGCI EEQGVEDSAN 

61 Q YNMNF EKLG KCII INNKNF 

121 MQDHLKKASE EDHTNAACFA 

181 KLFFIQACRG TEIiDDGIQAD 
-3 

241 SWFVQALCSI LEEHGKDLEI 
301 LYFSQ 
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EDSVDAKPDR SSFVPSLFSK 

DKVTGMGVRN GTDKDAEALF 
1 

CILIiSHGEEN VIYGKDGVTP 

2 

SGPINDTDAN PRYKIPVEAD 

MQILTRVNDR VARHFESQSD 



KKKNVTMRSI KTTRERVPTY 

KCFRS LGFDV IVYNDCSCAK 

I KDLT AHFRG DRSKTLLEKP 

FI»FAYSTVPG YYSWRSPGRG 
4 

DP HFHEKKQI PCWSMLTKE 
5 
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substrate A substrate B 
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proteolytic digestion of TNF 
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proteolytic digestion of serurn protein 
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SEQUENCE LISTING 



<110> DIREVO Biotech AG 

<12 0> NEW BIOLOGICAL ENTITIES AND THE USE THEREOF 

<130> 0414 81wo/JH/ml 

<160> 126 

<170> Patentln version 3.1 

<210> 1 

<211> 224 

<212> PRT 

<213> Homo sapiens 



<400> 1 



lie Val 


Gly 


Gly 


Tyr 


Asn 


Cys 


Glu 


Glu 


Asn 


Ser 


Val 


Pro 


Tyr 


Gin 


Val 


1 






5 










10 










15 




Ser Leu 


Asn 


Ser 


Gly 


Tyr 


Has 


Phe 


Cys 


Gly 


Gly 


Ser 


Leu 


He 


Asn 


Glu 






20 










25 










30 






Gin Tr-p 


Val 


Val 


Ser 


Ala 


Gly 


His 


Cys 


Tyr 


Lys 


Ser 


Arg 


lie 


Gin 


Val 




35 










40 










45 








Arg Leu 


Gly 


Glu 


His 


Asn 


He 


Glu 


Val 


Leu 


Glu 


Gly 


Asn 


Glu 


Gin 


Phe 


50 










55 










&U 










lie Asn 


Ala 


Ala 


Lys 


He 


He 


Arg 


HIS 


Pro 


Gin 


Tyr 


Asp 


Arg 


Lys 


Tor 


65 








70 










75 










8 0 


Leu Asn 


Asn 


Asp 


He 


Met 


Leu 


Tl fl 

ne 


Tim 

Lys 


Leu 


Ser 


Ser 


Arg 


TV "1 

Ala 


Val 


lie 








85 










90 










95 




Asn Ala 


Arg 


Val 


Ser 


Thr 


He 


oer 


Leu 


Pro 


Thr 


TV ~\ 

Ala 


Pro 


Pro 


7V T —, 

Ala 


Thr 






100 










105 










110 






Gly Thr 


Lys 


Cys 


Leu 


He 


Ser 


Gly 


Trp 


Gly 


Asn 


Thr 


Ala 


Ser 


Ser 


Gly 




115 










120 










125 








Ala Asp 


Tyr 


Pro 


Asp 


Glu 


Leu 


Gin 


Cys 


Leu 


Asp 


Ala 


Pro 


Val 


Leu 


Ser 


130 










135 










140 










Gin Ala 


Lys 


Cys 


Glu 


Ala 


Ser 


Tyr 


Pro 


Gly 


Lys 


He 


Thr 


Ser 


Asn 


Met 


145 








150 










155 










160 


Phe Cys 


Val 


Gly 


Phe 


Leu 


Glu 


Gly 


Gly 


Lys 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp 








165 










170 










175 




Ser Gly 


Gly 


Pro 


Val 


Val 


Cys 


Asn 


Gly 


Gin 


Leu 


Gin 


Gly 


Val 


Val 


Ser 






180 










185 










190 






Trp Gly 


Asp 


Gly 


Cys 


Ala 


Gin 


Lys 


Asn 


Lys 


Pro 


Gly 


Val 


Tyr 


Thr 


Lys 




195 










200 










205 








Val Tyr 


Asn 


Tyr 


Val 


Lys 


Trp 


He 


Lys 


Asn 


Thr 


lie 


Ala 


Ala 


Asn 


Ser 


210 










215 










220 










<210> 2 




























<211> 235 




























<212> PRT 




























<213> artificial sequence 




















<220> 






























<223> trypsin variant 1 




















<400> 2 




























He Val 


Gly Gly 


Tyr 


Asn 


Cys 


Glu 


Glu 


Asn 


Ser 


Val 


Pro 


Tyr 


Gin 


Val 


1 






5 










10 










15 




Ser Leu 


Asn 


Ser 


Gly 


Tyr 


His 


Phe 


Cys 


Gly 


Gly 


Ser 


Leu 


He 


Asn 


Glu 






20 










25 










30 






Gin Trp 


Val 


Val 


Ser 


Ala 


Gly 


His 


Cys 


Tyr 


Asp 


Ala 


Val 


Gly 


Arg 


Asp 




35 










40 










45 








Lys Ser 


Arg 


He 


Gin 


Val 


Arg 


Leu 


Gly 


Glu 


His 


Asn 


lie 


Glu 


Val 


Leu 



50 55 60 
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Glu Gly Asn Glu 
65 

Gin Tyr Asp Arg 

Ser Ser Arg Ala 

100 

Thr Ala Pro Pro 
115 

Asn Thr lie Thr 
130 

Glu Leu Gin Cys 
145 

Ala Ser Tyr Pro 

Leu Glu Gly Gly 

180 

Val Cys Asn Gly 
195 

Ala Gin Lys Asn 
210 

Lys Trp lie Lys 
225 



Gin Phe lie Asn 
70 

Lys Thr Leu Asn 

85 

Val lie Asn Ala 

Ala Thr Gly Thr 

120 

Asn Ser Thr Ala 
135 

Leu Asp Ala Pro 
150 

Gly Lys He Thr 
165 

Lys Asp Ser Cys 

Gin Leu Gin Gly 

200 

Lys Pro Gly Val 
215 

Asn Thr He Ala 
230 



Ala Ala Lys lie 
75 

Asn Asp He Met 
90 

Arg Val Ser Thr 
105 

Lys Cys Leu He 

Ser Ser Gly Ala 

140 

Val Leu Ser Gin 
155 

Ser Aen Met Phe 
170 

Gin Gly Asp Ser 
185 

Val Val Ser Trp 

Tyr Thr Lys Val 

220 

Ala Asn Ser 
235 



He Arg His Pro 

80 

Leu He Lys Leu 
95 

He Ser Leu Pro 
110 

Ser Gly Trp Gly 
125 

Asp Tyr Pro Asp 

Ala Lys Cys Glu 

160 

Cys Val Gly Phe 
175 

Gly Gly Pro Val 
190 

Gly Asp Gly Cys 
205 

Tyr Asn Tyr Val 



<210> 3 

<211> 235 

<212> PRT 

<213> artificial sequence 
<220> 

<223> trypsin variant 2 



<400> 3 




He 


Val 


Gly Gly 


1 








Ser 


Leu 


Asn 


Ser 








20 


Gin 


Trp 


val 


val 






35 




Lys 


Ser 


Arg 


He 




50 






Glu 


Gly 


Asn 


Glu 


65 








Gin 




Asp 


Arg 


Ser 


Ser 


Arg 


Ala 








100 


Thr 


Ala 


Pro 


Pro 






115 




Asn 


Val 


Arg Gly 




130 






Glu 


Leu 


Gin 


Cys 


145 








Ala 


Ser 


Tyr 


Pro 


Leu 


Glu 


Gly Gly 








180 


Val 


Cys 


Asn Gly 






195 




Ala 


Gin 


Lys 


Asn 




210 






Lys 


Trp 


He 


Lys 



225 



Tyr Asn Cys Glu 
5 

Gly Tyr His Phe 

Ser Ala Gly His 

40 

Gin Val Arg Leu 
55 

Gin Phe He Asn 
70 

Lys Thr Leu Asn 
85 

Val He Asn Ala 

Ala Thr Gly Thr 

120 

Thr Trp Thr Ala 
135 

Leu Asp Ala Pro 
150 

Gly Lys He Thr 
165 

Lys Asp Ser Cys 

Gin Leu Gin Gly 

200 

Lys Pro Gly Val 
215 

Asn Thr He Ala 
230 



Glu 


Asn 


Ser 


Val 




10 






Cys 


Gly Gly 


Ser 


25 








Cys 


Tyr 


Asn 


Gly 


Gly 


Glu 


His 


Asn 








60 


Ala 


Ala 


Lys 


He 






75 




Asn 


Asp 


He 


Met 




90 






Arg 


Val 


Ser 


Thr 


105 








Lys 


Cys 


Leu 


He 


Ser 


Ser 


Gly 


Ala 








140 


Val 


Leu 


Ser 


Gin 






155 




Ser 


Asn 


Met 


Phe 




170 






Gin Gly Asp 


Ser 


185 








Val 


Val 


Ser 


Trp 


Tyr 


Thr 


Lys 


Val 








220 


Ala 


Asn 


Ser 








235 





Pro Tyr Gin Val 
15 

Leu He Asn Glu 
30 

Arg Asp Leu Glu 
45 

He Glu Val Leu 

He Arg His Pro 

80 

Leu lie Lys Leu 
95 

He Ser Leu Pro 
110 

Ser Gly Trp Gly 
125 

Asp Tyr Pro Asp 

Ala Lys Cys Glu 

160 

Cys Val Gly Phe 
175 

Gly Gly Pro Val 
190 

Gly Asp Gly Cys 
205 

Tyr Asn Tyr Val 
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<210> 4 

<211> 235 

<212> PRT 

<213> artificial sequence 
<220> 

<223> trypsin variant 3 



<400> 4 



Tie* 
lie 


\r =t "i 
V a X 


Gly Gly 


1 








faer 


j_>eu 


Asn 


Ser 








20 


Gin 


Trp 


Val 


Val 






35 




Lys 


Ser 


Arg 


He 




50 






Glu 


Gly 


Asn 


Glu 


65 








Gin 


Tyr 


Asp 


Arg 


Ser 


Ser 


Arg 


Ala 








100 


Thr 


Ala 


Pro 


Pro 






115 




Asn 


Arg 


Lys 


Asp 




130 






Glu 


Leu 


Gin 


Cys 


145 








Ala 


Ser 


Tyr 


Pro 


Leu 


Glu 


Gly Gly 








180 


Val 


Cys 


Asn 


Gly 






195 




Ala 


Gin 


Lys 


Asn 




210 






Lys 


Trp 


He 


Lys 



225 



Tyr 


Asn 


Cys 


Glu 


5 








Gly 


Tyr 


HiS 


Phe 


Ser 


Ala 


Gly 


His 








40 


Gin 


Val 


Arg 


Leu 






55 




Gin 


Phe 


He 


Asn 




70 






Lys 


Thr 


Leu 


Asn 


85 








Val 


He 


Asn 


Ala 


Ala 


Thr 


Gly 


Thr 








120 


Phe 


Trp 


Thr 


Ala 






135 




Leu 


Asp 


Ala 


Pro 




150 






Gly 


Lys 


He 


Thr 


165 








Lys 


Asp 


Ser 


Cys 


Gin 


Leu 


Gin 


Gly 








200 


Lys 


Pro 


Gly 


Val 






215 




Asn 


Thr 


He 


Ala 




230 







Glu Asn Ser Val 
10 

Cys Gly Gly Ser 
25 

Cys Tyr Ala Ala 

Gly Glu His Asn 

60 

Ala Ala Lys He 
75 

Asn Asp He Met 
90 

Arg Val Ser Thr 
105 

Lys Cys Leu lie 

Ser Ser Gly Ala 

140 

Val Leu Ser Gin 
155 

Ser Asn Met Phe 
170 

Gin Gly Asp Ser 
185 

Val Val Ser Trp 

Tyr Thr Lys Val 

220 

Ala Asn Ser 
235 



Pro 


Tyr 


Gin 


Val 






15 




Leu 


He 


Asn 


Glu 




30 






Thr 


Asn 


Gly Asp 


45 








He 


Glu 


Val 


Leu 


He 


Arg 


His 


Pro 








80 


Leu 


He 


Lys 


Leu 






95 




He 


Ser 


Leu 


Pro 




110 






Ser 


Gly 


Trp 


Gly 


125 








Asp 


Tyr 


Pro 


Asp 


Ala 


Lys 


Cys 


Glu 








160 


Cys 


Val 


Gly 


Phe 






175 




Gly 


Gly 


Pro 


Val 




190 






Gly 


Asp Gly 


Cys 


205 








Tyr 


Asn 


Tyr 


Val 



<210> 5 

<211> 259 

<212> PRT 

<213> Homo sapiens 



<400> 5 



He 


Val 


Glu 


Gly 


Ser 


Asp 


Ala 


Glu 


He Gly Met 


Ser 


Pro 


Trp 


Gin 


Val 


1 








5 








10 








15 




Met 


Leu 


Phe 


Arg 
20 


Lys 


Ser 


Pro 


Gin 


Glu Leu Leu 
25 


Cys 


Gly 


Ala 
30 


Ser 


Leu 


He 


Ser 


Asp 

35 


Arg 


Trp 


Val 


Leu 


Thr 
40 


Ala Ala His 


Cys 


Leu 
45 


Leu 


Tyr 


Pro 


Pro 


Trp 
50 


Asp 


Lys 


Asn 


Phe 


Thr 
55 


Glu 


Asn Asp Leu 


Leu 
60 


Val 


Arg 


He 


Gly 


Lys 


His 


Ser 


Arg 


Thr 


Arg 


Tyr 


Glu 


Arg Asn He 


Glu 


Lys 


He 


Ser 


Met 


65 










70 






75 










80 


Leu 


Glu 


Lys 


He 


Tyr 
as 


He 


His 


Pro 


Arg Tyr Asn 
90 


Trp 


Arg 


Glu 


Asn 
95 


Leu 


Asp 


Arg 


Asp 


He 
100 


Ala 


Leu 


Met 


Lys 


Leu Lys Lys 
105 


Pro 


Val 


Ala 
110 


Phe 


Ser 


Asp 


Tyr 


He 
115 


His 


Pro 


Val 


Cys 


Leu 
120 


Pro Asp Arg 


Glu 


Thr 
125 


Ala 


Ala 


Ser 
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DC U 


Gin 






130 






Lys 


Glu 


Thr 


Trp 


X 1 *. o 








Gin 


Val 


Val 


Asn 


Thr 


Arg 


lie 


Arg 








180 


Asp 


Glu 


Gly 


Lys 






195 




Phe 


Val 


Met 


Lys 




210 






val 


Ser 


Trp 


Gly 


225 








Thr 


His 


Val 


Phe 


Phe 


Gly 


Glu 





Gly Tyr Lys Gly 
135 

Thr Ala Asn Val 
150 

Leu Pro lie Val 
165 

lie Thr Asp Asn 

Arg Gly Asp Ala 

200 

Ser Pro Phe Asn 
215 

Glu Gly Cys Asp 
230 

Arg Leu Lys Lys 
245 



7V -v~ t—i 

Arg 


val 


Thr 


oj.y 








JL t \J 


Gly 


Lys Gly 


Gin 






155 




Glu 


Arg 


Pro 


Val 




170 






Met 


Phe 


Cys 


Ala 


185 








cys 


Glu Gly 


Asp 


Asn 


Arg 


Trp 


Tyr 








220 


Arg 


Asp 


Gly 


Lys 






235 




Trp 


lie 


Gin 


Lys 




250 







Trp Gly Asn Leu 

Pro Ser Val Leu 

160 

Cys Lys Asp Ser 
175 

Gly Tyr Lys Pro 
190 

Ser Gly Gly Pro 
205 

Gin Met Gly lie 

Tyr Gly Phe Tyr 

240 

Val lie Asp Gin 

255 



<210> 6 

<211> 235 

<212> PRT 

<213> Homo sapiens 



<400> 6 



He 


Val 


Gly Gly 


Ser 


Asn 


Ala 


Lys 


Glu 


Gly Ala 


Trp 


Pro 


Trp 


Val 


val 


1 








5 










10 








15 




Gly 


Leu 


Tyr Tyr Gly Gly Arg Leu 


Leu 


Cys Gly 


Ala 


Ser 


Leu 


Val 


Ser 








20 










25 








30 






Ser 


Asp 


Trp Leu 


Val 


Ser 


Ala 


Ala 


His 


Cys Val 


Tyr 


Gly 


Arg 


Asn 


Leu 






35 










40 








45 








Glu 


Pro 


Ser 


Lys 


Trp 


Thr 


Ala 


He 


Leu 


Gly Leu 


His 


Met 


Lys 


Ser 


Asn 




50 










55 








60 










Leu 


Thr 


Ser 


Pro 


Gin 


Thr 


Val 


Pro Arg 


Leu He 


Asp 


Glu 


He 


val 


He 


65 










70 








75 










80 


Asn 


Pro 


His 


Tyr 


Asn 


Arg 


Arg 


Arg 


Lys 


Asp Asn 


Asp 


He 


Ala 


Met 


Met 










85 










90 








95 




His 


Leu 


Glu 


Phe 


Lys 


Val 


Asn 


Tyr 


Thr 


Asp Tyr 


He 


Gin 


Pro 


He 


Cys 








100 










105 






110 






Leu 


Pro 


Glu 


Glu 


Asn 


Gin 


Val 


Phe 


Pro 


Pro Gly 


Arg 


Asn 


Cys 


Ser 


He 






115 










120 








125 








Ala 


Gly 


Trp 


Gly 


Thr 


Val 


Val 


Tyr 


Gin 


Gly Thr 


Thr 


Ala 


Asn 


He 


Leu 




130 










135 








14 0 










Gin 


Glu 


Ala 


Asp 


Val 


Pro 


Leu 


Leu 


Ser 


Asn Glu 


Arg 


Cys 


Gin 


Gin 


Gin 


145 










150 








155 










160 


Met 


Pro 


Glu 


Tyr 


Asn 


He 


Thr 


Glu 


Asn 


Met He 


Cy3 


Ala 


Gly 


Tyr 


Glu 










165 










170 








175 




Glu 


Gly 


Gly 


He 


Asp 


Ser 


Cys 


Gin 


Gly 


Asp Ser 


Gly 


Gly 


Pro 


Leu 


Met 








180 










185 








190 






Cys 


Gin 


Glu 


Asn 


Asn 


Arg 


Trp 


Phe 


Leu 


Ala Gly 


Val 


Thr 


Ser 


Phe 


Gly 






195 










200 








205 








Tyr 


Lys 


Cys 


Ala 


Leu 


Pro 


Asn Arg 


Pro 


Gly Val 


Tyr 


Ala 


Arg 


Val 


Ser 




210 










215 








220 










Arg 


Phe 


Thr 


Glu 


Trp 


He 


Gin 


Ser 


Phe 


Leu His 












225 










230 








235 













<210> 7 

<211> 275 

<212> PRT 

<213s» Bacillus subtilis 
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5 



<400> 7 












lie 


Ala 


nJla 


Glu 


Tyr 


Ala 


Gin Ser 


Val Pro 


X 








5 






10 


i>ys 


Ala 


rlO 


Ala 


Leu 


His 


Ser Gin Gly Tyr 








20 








25 


VdJ- 






lie 


Asp 


Ser Gly He 


Asp Ser 














40 








vjj.y 


Gly Ala 


Ser 


Phe Val 


Pro Ser 




b 0 










55 




Asp 


tsiy 


O A *V-* 


Ser 


His 


Gly Thr His Val Ala 


65 










70 






Asn 


Asn 


Sex 


lie 


Gly 


Val 


Leu Gly Val Ser 










85 






90 


Ala 


Val 


Lys 


val 


Leu 


Asp Ser Thr Gly Ser 








100 








105 


lie 


Asn 


Gly 


He 


Glu 


Trp Ala He 


Ser Asn 






115 








120 




Met 


Ser 


Leu 


Gly 


Gly 


Pro 


Thr Gly 


Ser Thr 




130 










135 




Asp 


Lys 


Ala 


Val 


Ser 


Ser 


Gly He 


Val Val 


145 










150 






Glu 


Gly 


Ser 


Ser 


Gly 


Ser 


Thr Ser 


Thr Val 










165 






170 


Pro 


Ser 


Thr 


He 


Ala 


Val 


Gly Ala 


Val Asn 








180 








185 


Ser 


Phe 


Ser 


Ser 


Ala 


Gly 


Ser Glu 


Leu Asp 






195 








200 




Ser 


lie 


Gin 


Ser 


Thr 


Leu 


Pro Gly Gly Thr 




210 










215 




Thr 


Ser 


Met 


Ala 


Thr 


Pro 


His Val 


Ala Gly 


225 










230 






Ser 


Lys 


His 


Pro 


Thr 


Trp 


Thr Asn 


Ala Gin 










245 






250 


Ser 


Thr 


Ala 


Thr 


Tyr 


Leu 


Gly Asn 


Ser Phe 








260 








265 


lie 


Asn 


Val 













275 



<210> 8 

<211> 320 

<212> PRT 

<213> Murinae gen. sp . 



<400> 8 
















Val 


Ala 


Lys 


Arg 


Arg 


Ala 


Lys 


Arg 


Asp 


Val 


l 








5 










10 


Pro 


Lys 


Phe 


Pro 

20 


Gin 


Gin 


Trp 


Tyr 


Leu 
25 


Ser 


Leu 


Asn 


Val 
35 


Lys 


Glu 


Ala 


Trp 


Ala 
40 


Gin 


Gly 


Val 


Val 


Ser 


He 


Leu 


Asp Asp Gly 


He 


Glu 




50 










55 








Ala 


Gly 


Asn 


Tyr 


Asp 


Pro 


Gly Ala 


Ser 


Phe 


65 










70 










Pro 


Asp 


Pro 


Gin 


Pro 
85 


Arg 


Tyr 


Thr 


Gin 


Met 
90 


Thr 


Arg 


Cy3 


Ala 
100 


Gly 


Glu 


Val 


Ala 


Ala 
105 


Val 


Gly 


Val 


Gly 
115 


Val 


Ala 


Tyr 


Asn 


Ala 
120 


Arg 


lie 


Asp 


Gly 
130 


Glu 


Val 


Thr 


Asp 


Ala 
135 


Val 


Glu 


Ala 
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Tyr 


Gly 


He 


Ser 


Gin 


He 










15 




Thr 


Gly 


Ser 


Asn 


Val 


Lys 








30 






Ser 


His 


Pro 


Asp 


Leu 


Asn 






45 








V7IU 


ml. -w- 

inr 


7V f"i 

Asn 


riO 


iyr 


bin 




60 










Gly 


Thr 


He 


Ala 


Ala 


Leu 


75 










80 


Pro 


Ser 


Ala 


Ser 


Leu 


Tyr 










95 




Gly 


Gin 


Tyr 


Ser 


Trp 


He 








110 






Asn 


Met 


Asp 


Val 


He 


Asn 






12 5 








Ala 


Leu 


Lys 


Thr 


Val 


Val 




140 










Ala 


Ala 


Ala 


Ala 


Gly 


Asn 


155 










160 


Gly 


Tyr 


Pro 


Ala 


Lys 


Tyr 










175 




Ser 


Ser 


Asn 


Gin 


Arg 


Ala 








190 






Val 


Met 


Ala 


Pro 


Gly 


val 






205 








Tyr 


Gly Ala 


Tyr 


Asn 


Gly 




220 










Ala 


Ala 


Ala 


Leu 


He 


Leu 


235 










240 


Val 


Arg 


Asp 


Arg 


Leu 


Glu 










255 




Tyr 


Tyr 


Gly 


Lys 


Gly 


Leu 



270 



Tyr 


Gin 


Glu 


Pro 


Thr 
15 


Asp 


Gly 


Val 


Thr 


Gin 
30 


Arg 


Asp 


Phe 


Thr 


Gly His 


Gly 


He 






45 








Lys 


Asn 
60 


His 


Pro 


Asp 


Leu 


Asp 


Val 


Asn 


Asp 


Gin 


Asp 


75 










80 


Asn 


Asp 


Asn 


Arg 


His 
95 


Gly 


Ala 


Asn 


Asn Gly Val 


Cys 








110 






Gly 


Gly Val 


Arg 


Met 


Leu 






125 








Arg 


Ser 


Leu 


Gly 


Leu 


Asn 



14 0 
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Pro 


Asn 


His 


He 


His He Tvr 


Ser 


Ala 


Ser 


Trp 


Glv 


Pro 


Glu 


A nn 




145 








150 








155 












Glv 


Lvg 

1 m J * — * 


Thr 


Val 


Asn Glv Prn 


Ala Arg Leu Ala 




bill 


AJia 


■fne 


Fne 










165 






170 










1 t; 
j. / _> 




Arci 


Glv 


Val 


Ser 


Gin Glv At-ct 


Gly Gly Leu Gly 


Car 
OCX. 


X Xc 




vaj. 


Trp 








180 






185 














Ala 


Ser 


Glv 


Asn 


Glv Glv Ara 


Glu 


His 


Asp 


Ser 


V- y a 


7\ an 


V— _y £3 


nop 


ijxy 






195 






200 










205 








Tyr 


Thr 


Asn 


Ser 


He Tyr Thr 


Leu 


Ser 


He 


Ser 


Ser 


Ala 


Thr 


Gin 


Phe 




210 






215 










220 










Gly 


Asn 


Val 


Pro 


Trp Tyr Ser 


Glu 


Ala 


Cys 


Ser 


Ser 


Thr 


Leu 


Ala 


Thr 


225 








230 








235 










240 


Thr 


Tyr 


Ser 


Ser 


Gly Asn Gin 


Asn 


Glu 


Lys 


Gin 


He 


Val 


Thr 


Thr 


Asp 










245 






250 










255 


Leu 


Arg 


Gin 


Lys 


Cys Thr Glu 


Ser 


His 


Thr 


Gly 


Thr 


Ser 


Ala 


Ser 


Ala 








260 






265 










270 






Pro 


Leu 


Ala 


Ala 


Gly He He 


Ala 


Leu 


Thr 


Leu 


Glu 


Ala 


Asn 


Lys 


Asn 






275 






280 










285 








Leu 


Thr 


Trp 


Arg 


Asp Met Gin 


His 


Leu 


Val 


Val 


Gin 


Thr 


Ser 


Lys 


Pro 




290 






295 










300 










Ala 


His 


Leu 


Asn 


Ala Asp Asp 


Trp 


Ala 


Thr 


Asn 


Gly 


Val 


Gly Arg 


Lys 



305 310 315 320 



<210> 9 

<211> 330 

<212> PRT 

<213> Homo sapiens 



<400> 9 



Glu 


Lys 


Glu 


Arg 


Ser 


Lys 


Arg 


Ser 


Ala 


Leu 


Arg 


Asp 


Ser 


Ala 


Leu 


Asn 


1 








5 










10 










15 




Leu 


Phe 


Asn 


Asp 


Pro 


Met 


Trp 


Asn 


Gin 


Gin 


Trp 


Tyr 


Leu 


Gin 


Asp 


Thr 








20 










25 










30 






Arg 


Met 


Thr 


Ala 


Ala 


Leu 


Pro 


Lys 


Leu 


Asp 


Leu 


His 


Val 


He 


Pro 


Val 






35 










40 










45 








Trp 


Gin 


Lys 


Gly 


He 


Thr 


Gly 


Lys 


Gly Val 


Val 


He 


Thr 


Val 


Leu 


Asp 




50 










55 










60 










Asp 


Gly Leu 


Glu 


Trp 


Asn 


His 


Thr 


Asp 


He 


Tyr 


Ala 


Asn 


Tyr 


Asp 


Pro 


65 










70 










75 










80 


Glu 


Ala 


Ser 


Tyr 


Asp 


Phe 


Asn 


Asp 


Asn 


Asp 


His 


Asp 


Pro 


Phe 


Pro 


Arg 










85 










90 










95 




Tyr 


Asp 


, Pro 


Thr 


Asn 


Glu 


Asn 


Lys 


His 


Gly 


Thr 


Arg 


Cys 


Ala 


Gly 


Glu 








100 










105 










110 






He 


Ala 


Met 


Gin 


Ala 


Asn 


Asn 


His 


Lys 


Cys 


Gly 


Val 


Gly 


Val 


Ala 


Tyr 






115 










120 










125 








Asn 


Ser 


Lys 


Val 


Gly Gly 


He 


Arg 


Met 


Leu 


Asp 


Gly 


He 


Val 


Thr 


Asp 




130 










135 










140 










Ala 


He 


Glu 


Ala 


Ser 


Ser 


He 


Gly 


Phe 


Asn 


Pro 


Gly His 


val 


Asp 


He 


145 










150 










155 










160 


Tyr 


Ser 


Ala 


Ser 


Trp Gly 


Pro 


Asn Asp Asp 


Gly 


Lys 


Thr 


val 


Glu 


Gly 










165 










170 










175 




Pro 


Gly Arg 


Leu 


Ala 


Gin 


Lys 


Ala 


Phe 


Glu 


Tyr 


Gly Val 


Lys 


Gin 


Gly 








180 










185 










190 






Arg 


Gin 


Gly 


Lys 


Gly Ser 


He 


Phe 


Val 


Trp 


Ala 


Ser 


Gly Asn 


Gly 


Gly 






195 










200 










205 








Arg 


Gin Gly 


Asp Asn 


Cys 


Asp 


Cys 


Asp Gly Tyr 


Thr 


Asp 


Ser 


He 


Tyr 




210 










215 










220 










Thr 


He 


Ser 


He 


Ser 


Ser 


Ala 


Ser 


Gin 


Gin 


Gly 


Leu 


Ser 


Pro 


Trp 


Tyr 


225 










230 










235 










240 


Ala 


Glu 


Lys 


Cys 


Ser 


Ser 


Thr 


Leu 


Ala 


Thr 


Ser 


Tyr 


Ser 


Ser 


Gly 


Asp 










245 










250 










255 




Tyr 


Thr 


Asp 


Gin 


Arg 


He 


Thr 


Ser 


Ala 


Asp 


Leu 


His 


Asn Asp 


Cys 


Thr 
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260 

Glu Thr His Thr 
275 

Phe Ala Leu Ala 
290 

Gin His Leu Val 
305 

Pro Gly Trp Lys 



Gly Thr Ser Ala 

280 

Leu Glu Ala Asn 
295 

Val Trp Thr Ser 
310 

Lys Asn Gly Ala 
325 



265 

Ser Ala Pro Leu 

Pro Asn Leu Thr 

300 

Glu Tyr Asp Pro 
315 

Gly Leu 
330 



270 

Ala Ala Gly He 
285 

Trp Arg Asp Met 

Leu Ala Asn Asn 

320 



<210> 10 

<211> 297 

<212> PRT 

<213> Homo sapiens 



<400> 10 




























Asn 


Thr 


His 


Pro 


Cys 


Gin 


Ser 


Asp 


Met 


Asn 


He 


Glu 


Gly Ala 


Trp 


Lys 


l 








5 










10 










15 




Arg 


Gly 


Tyr 


Thr 
20 


Gly 


Lys 


Asn 


He 


Val 
25 


Val 


Thr 


He 


Leu 


Asp 
30 


Asp 


Gly 


He 


Glu 


Arg 

35 


Thr 


His 


Pro 


Asp 


Leu 
40 


Met 


Gin 


Asn 


Tyr 


Asp 
45 


Ala 


Leu 


Ala 


Ser 


Cys 
50 


Asp 


Val 


Asn 


Gly 


Asn 
55 


Asp 


Leu 


Asp 


Pro 


Met 
60 


Pro 


Arg 


Tyr 


Asp 


Ala 


Ser 


Asn 


Glu 


Asn 


Lys 


His 


Gly Thr Arg 


Cys 


Ala 


Gly 


Glu 


Val 


Ala 


65 










70 










75 










80 


Ala 


Ala 


Ala 


Asn 


Asn 
85 


Ser 


His 


Cys 


Thr 


Val 
90 


Gly 


He 


Ala 


Phe 


Asn 
95 


Ala 


Lys 


He 


Gly Gly 


Val 


Arg 


Met 


Leu 


Asp 


Gly 


Asp 


Val 


Thr 


Asp 


Met 


Val 








100 










105 










110 






Glu 


Ala 


Lys 
115 


Ser 


Val 


Ser 


Phe 


Asn 
120 


Pro 


Gin 


His 


Val 


His 
125 


He 


Tyr 


Ser 


Ala 


Ser 


Trp Gly 


Pro 


Asp 


Asp 


Asp 


Gly 


Lys 


Thr 


Val 


Asp 


Gly 


Pro 


Ala 




130 










135 










140 










Pro 


Leu 


Thr 


Arg 


Gin 


Ala 


Phe 


Glu Asn Gly 


Val 


Arg 


Met 


Gly Arg 


Arg 


145 










150 










155 










160 


Gly 


Leu 


Gly 


Ser 


Val 


Phe 


Val 


Trp Ala 


Ser 


Gly 


Asn 


Gly 


Gly Arg 


Ser 










165 










170 










175 




Lys 


Asp 


His 


Cys 
160 


Ser 


Cys 


Asp 


Gly 


Tyr 
185 


Thr 


Asn 


Ser 


He 


Tyr 
190 


Thr 


He 


Ser 


He 


Ser 
195 


Ser 


Thr 


Ala 


Glu 


Ser 
200 


Gly 


Lys 


Lys 


Pro 


Trp 
205 


Tyr 


Leu 


Glu 


Glu 


Cys 
210 


Ser 


Ser 


Thr 


Leu 


Ala 
215 


Thr 


Thr 


Tyr 


Ser 


Ser 
220 


Gly 


Glu 


Ser 


Tyr 


Asp 


Lys 


Lys 


He 


He 


Thr 


Thr 


Asp 


Leu 


Arg 


Gin 


Arg 


Cys 


Thr 


Asp 


Asn 


225 










230 










235 










240 


His 


Thr 


Gly 


Thr 


Ser 
245 


Ala 


Ser 


Ala 


Pro 


Met 
250 


Ala 


Ala 


Gly 


He 


He 
255 


Ala 


Leu 


Ala 


Leu 


Glu 


Ala 


Asn 


Pro 


Phe 


Leu 


Thr 


Trp Arg Asp 


Val 


Gin 


His 








260 










265 










270 






Val 


He 


Val 


Arg 


Thr 


Ser Arg Ala Gly His 


Leu 


Asn 


Ala 


Asn 


Asp 


Trp 






275 










280 










285 








Lys 


Thr 
290 


Asn 


Ala 


Ala 


Gly 


Phe 
295 


Lys 


Val 

















<210> 11 

<211> 328 

<212> PRT 

<213> Homo sapiens 



<400> 



11 
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Thr Leu Val Asp 
1 

Phe Gly Thr lie 

20 

Phe Asp Thr Gly 

35 

Ser Leu Ala Cys 
50 

Thr Tyr Gin Ser 
65 • 

Ser Met Thr Gly 

Ser Asp Thr Asn 

100 

Phe Leu Tyr Tyr 
115 

Ser lie Ser Ser 
130 

Gin Gly Leu Val 
145 

Asp Lys Ser Gly 

Tyr Thr Gly Ser 

180 

Gin He Thr Val 
195 

Ala Glu Gly Cys 
210 

Gly Pro Thr Ser 
225 

Glu Asn Ser Asp 

Leu Pro Asp He 

260 

Pro Ser Ala Tyr 
275 

Gin Gly Met Asn 
290 

Asp Val Phe He 
305 

Gin Val Gly Leu 



Glu Gin Pro Leu 
5 

Gly He Gly Thr 

Ser Ser Asn Leu 

40 

Thr Asn His Asn 
55 

Thr Ser Glu Thr 
70 

He Leu Gly Tyr 
85 

Gin He Phe Gly 

Ala Pro Phe Asp 

120 

Ser Gly Ala Thr 
135 

Ser Gin Asp Leu 
150 

Ser Val Val He 
165 

Leu Asn Trp Val 

Asp Ser He Thr 

200 

Gin Ala lie Val 
215 

Pro He Ala Asn 
230 

Gly Asp Met Val 
245 

Val Phe Thr He 

He Leu Gin Ser 

280 

Val Pro Thr Glu 
295 

Arg Gin Tyr Phe 
310 

Ala Pro Val Ala 
325 



Glu Asn Tyr Leu 
10 

Pro Ala Gin Asp 
25 

Trp Val Pro Ser 

Arg Phe Asn Pro 

60 

Val Ser He Thr 
75 

Asp Thr Val Gin 
90 

Leu Ser Glu Thr 
105 

Gly He Leu Gly 

Pro Val Phe Asp 

140 

Phe Ser Val Tyr 
155 

Phe Gly Gly He 
170 

Pro Val Thr Val 
185 

Met Asn Gly Glu 

Asp Thr Gly Thr 

220 

He Gin Ser Asp 
235 

Val Ser Cys Ser 
250 

Asn Gly Val Gin 
265 

Glu Gly Ser Cys 

Ser Gly Glu Leu 

300 

Thr Val Phe Asp 
315 





Met 


Glu 


Tvr 






15 




Phe 


Thr 


Val 


Val 




30 






Val 


Tyr 


Cys 


Ser 


45 








Glu 


Asp 


Ser 


Ser 


Tvr 


Gly Thr 


Glv 








80 


V Q A- 


Gly 


Gly 








95 






Pro 


Gly 


O 




110 






Leu 


Ala 


Tyr 


Pro 


_L 








r\C3 Li 


He 


Trp 


nail 


T.pi 1 
iJCU 


Ser 


Ala 










J.OU 


Asp 


Ser 


Ser 


Tyr 






175 




blU 


Gly 


Tyr 


Trp 




190 






Thr 


He 


Ala 


Cys 


one: 








Ser 


Leu 


Leu 


Thr 


lie 


Gly Ala 


Ser 








240 


Ala 


He 


Ser 


Ser 






255 




Tyr 


Pro 


Val 


Pro 




270 






He 


Ser 


Gly 


Phe 


285 








Trp 


He 


Leu 


Gly 


Arg 


Ala 


Asn 


Asn 








320 



<210> 12 

<211> 358 

<212> PRT 

<213> Homo sapiens 



<400> 12 

Glu Met Val Asp 

1 

Glu Met Thr Val 

20 

Thr Gly Ser Ser 
35 

His Arg Tyr Tyr 
50 

Lys Gly Val Tyr 
65 

Gly Thr Asp Leu 
Ala Asn He Ala 



Asn Leu Arg Gly 
5 

Gly Ser Pro Pro 

Asn Phe Ala Val 

40 

Gin Arg Gin Leu 
55 

Val Pro Tyr Thr 
70 

Val Ser He Pro 
85 

Ala He Thr Glu 



Lys Ser Gly Gin 
10 

Gin Thr Leu Asn 

25 

Gly Ala Ala Pro 

Ser Ser Thr Tyr 

60 

Gin Gly Lys Trp 
75 

His Gly Pro Asn 
90 

Ser Asp Lys Phe 



Gly Tyr Tyr Val 
15 

He Leu Val Asp 
30 

His Pro Phe Leu 
45 

Arg Asp Leu Arg 

Glu Gly Glu Leu 

80 

Val Thr Val Arg 
95 

Phe He Asn Gly 
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100 

Ser Asn Trp Glu 
115 

Pro Asp Asp Ser 
130 

His Val Pro Asn 
145 

Leu Asn Gin Ser 

Gly Gly lie Asp 

180 

He Arg Arg Glu 
195 

Asn Gly Gin Asp 
210 

Ser He Val Asp 
225 

Phe Glu Ala Ala 

Phe Pro Asp Gly 

260 

Gly Thr Thr Pro 
275 

Gly Glu Val Thr 

290 

Tyr Leu Arg Pro 
305 

Lys Phe Ala lie 

He Met Glu Gly 

340 

Gly Phe Ala Val 
355 



Gly He Leu Gly 

120 

Leu Glu Pro Phe 
135 

Leu Phe Ser Leu 
150 

Glu Val Leu Ala 
165 

His Ser Leu Tyr 

Trp Tyr Tyr Glu 

200 

Leu Lys Met Asp 
215 

Ser Gly Thr Thr 
230 

Val Lys Ser lie 

245 

Phe Trp Leu Gly 

Trp Asn He Phe 

280 

Asn Gin Ser Phe 
295 

Val Glu Asp Val 
310 

Ser Gin Ser Ser 
325 

Phe Tyr Val Val 
Ser Ala 



105 








Leu 


Ala 


Tyr 


Ala 


Phe 


Asp 


Ser 


Leu 








140 


Gin 


Leu 


Cys 


Gly 






155 




Ser 


Val 


Gly 


Gly 




170 






Thr Gly 


Ser 


Leu 


185 








Val 


lie 


He 


Val 


Cys 


Lys 


Glu 


Tyr 








220 


Asn 


Leu 


Arg 


Leu 






235 




Lys 


Ala 


Ala 


Ser 




250 






Glu 


Gin 


Leu 


Val 


265 








Pro 


Val 


He 


Ser 


Arg 


He 


Thr 


He 








300 


Ala 


Thr 


Ser 


Gin 






315 




Thr 


Gly 


Thr 


Val 




330 






Phe 


Asp 


Arg 


Ala 


345 











110 






Glu 


He 


Ala 


Arg 


125 








Val 


Lys 


Gin 


Thr 


Ala 


Gly 


Phe 


Pro 








160 


Ser 


Met 


He 


He 






175 




Trp 


Tyr 


Thr 


Pro 




190 






Arg 


Val 


Glu 


He 


205 








Asn 


Tyr 


Asp 


Lys 


Pro 


Lys 


Lys 


Val 








240 


Ser 


Thr 


Glu 


Lys 






255 




Cys 


Trp 


Gin 


Ala 




270 






Leu 


Tyr 


Leu 


Met 


285 








Leu 


Pro 


Gin 


Gin 


Asp 


Asp 


Cys 


Tyr 








320 


Met 


Gly 


Ala 


val 






335 




Arg 


Lys 


Arg 


He 




350 







<210> 13 

<211> 351 

<212> PRT 

<213> Homo sapiens 



<400> 13 

Pro Ala Val Thr 

1 ' 

Asp Ala Gin Tyr 

20 

Phe Thr Val Val 
35 

He His Cys Lys 
50 

Asn Ser Asp Lys 
65 

He His Tyr Gly 

Val Ser Val Pro 

100 

Val Lys Val Glu 
115 

He Thr Phe He 
130 

Pro Arg He Ser 
145 

Gin Gin Lys Leu 



Glu Gly Pro He 
5 

Tyr Gly Glu He 

Phe Asp Thr Gly 

40 

Leu Leu Asp He 
55 

Ser Ser Thr Tyr 
70 

Ser Gly Ser Leu 
85 

Cys Gin Ser Ala 

Arg Gin Val Phe 

120 

Ala Ala Lys Phe 
135 

Val Asn Asn Val 
150 

Val Asp Gin Asn 
165 



Pro Glu Val Leu 
10 

Gly He Gly Thr 
25 

Ser Ser Asn Leu 

Ala Cys Trp He 

60 

Val Lys Asn Gly 
75 

Ser Gly Tyr Leu 
90 

Ser Ser Ala Ser 
105 

Gly Glu Ala Thr 

Asp Gly He Leu 

140 

Leu Pro Val Phe 
155 

He Phe Ser Phe 
170 



Lys 


Asn 


Tyr 


Met 






15 




Pro 


Pro 


Gin 


Cys 




30 






Trp 


Val 


Pro 


Ser 


45 








His 


His 


Lys 


Tyr 


Thr 


Ser 


Phe 


Asp 








80 


Ser 


Gin 


Asp 


Thr 






95 




Ala 


Leu 


Gly 


Gly 


» 


110 






Lys 


Gin 


Pro 


Gly 


125 








Gly Met 


Ala 


Tyr 


Asp 


Asn 


Leu 


Met 








160 


Tyr 


Leu 


Ser 


Arg 






175 
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Asp 


Pro 


ASp 


Aid 




Pro 




t»j.y 








iOU 










Ser 


Lys 


Tyr 


Tyr 


T ■ ■ m-m 

Lys 


Gly 


Ser 


Leu 






IOC 

19b 










i ri r» 

200 


Ala 


Tyr 


Trp 


/IT _ 

Gin 


Val 


TT -t — 

His 


T 

Leu 


_ ____ 

Asp 




210 














Thr 


Leu 


Cys 


Lys 


GlU 


Gly 


Cys 


GlU 


225 
















Leu 


Met 


Val 


Gly 


Pro 


Val 


Asp 


Glu 










245 








Gly Ala 


Val 


Pro 


Leu 


lie 


Gin 


Gly 








260 










Val 


Ser 


Thr 


Leu 


Pro 


Ala 


lie 


Thr 






275 










280 


Lys 


Leu 


Ser 


Pro 


Glu 


Asp 


Tyr 


Thr 




290 










295 




Thr 


Leu 


Cys 


Leu 


Ser 


Gly 


Phe 


Met 


305 










310 






Gly 


Pro 


Leu 


Trp 


lie 


Leu 


Gly 


Asp 










325 








Val 


Phe 


Asp 


Arg 


Asp 


Asn 


Asn 


Arg 



340 



<210> 14 

<211> 305 

<212> PRT 

<213> Homo sapiens 



<400> 14 



Met 


Leu 


Glu 


Ala 


Asp 


Asp 


Gin 


Gly 


1 








5 








Asp 


Ser 


Ala 


Asn 


Glu 


Asp 


Ser 


Val 








20 










Phe 


Val 


Pro 


Ser 


Leu 


Phe 


Ser 


Lys 






35 










40 


Ser 


He 


Lys 


Thr 


Thr 


Arg 


Asp 


Arg 




50 










55 




Asn 


Phe 


Glu 


Lys 


Leu 


Gly 


Lys 


Cys 


65 










70 






Asp 


Lys 


Val 


Thr 


Gly 


Met 


Gly 


Val 










85 








Glu 


Ala 


Leu 


Phe 


Lys 


Cys 


Phe 


Arg 








100 










Tyr 


Asn 


Asp 


Cys 


Ser 


Cys 


Ala 


Lys 






115 










120 


Ser 


Glu 


Glu 


Asp 


His 


Thr 


Asn 


Ala 




130 










135 




Ser 


His 


Gly 


Glu 


Glu 


Asn 


Val 


He 


145 










150 






He 


Lys 


Asp 


Leu 


Thr 


Ala 


His 


Phe 










165 








Leu 


Glu 


LyB 


Pro 


Lys 


Leu 


Phe 


Phe 








180 










Leu 


Asp 


Asp 


Gly 


He 


Gin 


Ala 


Asp 






195 










200 


Ala 


Asn 


Pro 


Arg 


Tyr 


Lys 


He 


Pro 




210 










215 




Tyr 


Ser 


Thr 


val 


Pro 


Gly 


Tyr 


Tyr 


225 










230 






Ser 


Trp 


Phe 


Val 


Gin 


Ala 


Leu 


Cys 










245 








Asp 


Leu 


Glu 


lie 


Met 


Gin 


He 


Leu 



GlU 


Leu 


Met 


Leu 


Gly Gly 


Thr 


Asp 


toe 
185 










190 






Ser 


Tyr 


Leu 


Asn 


Val 


Thr 


Arg 


Lys 










205 








Gin 


Val 


Glu 


Val 


Ala 


Ser 


Gly 


Leu 








220 










Ala 


He 


Val 


Asp 


Thr 


Gly Thr 


Ser 






<s3d 










O A t\ 

24 0 


Val 


Arg 


Glu 


Leu 


Gin 


Lys 


Ala 


He 




250 










255 




Glu 


Tyr 


Met 


He 


Pro 


Cys 


Glu 


Lys 


265 










270 






Leu 


Lys 


Leu 


Gly 


Gly 


Lys 


Gly 


Tyr 










285 








Leu 


Lys 


Val 


Ser 


Gin 


Ala 


Gly 


Lys 








300 










Gly 


Met 


Asp 


He 


Pro 


Pro 


Pro 


Ser 






315 










320 


Val 


Phe 


He 


Gly Arg 


Tyr 


Tyr 


Thr 




330 










335 




Val 


Gly 


Phe 


Ala 


Glu 


Ala 


Ala 





345 350 



Cys 


lie 


Glu 


Glu 


Gin 


Gly 


Val 


Glu 




10 










15 




Asp 


Ala 


Lys 


Pro 


Asp 


Arg 


Ser 


Ser 


25 










30 






Lys 


Lys 


Lys Asn 


Val 


Thr 


Met 


Arg 










45 








Val 


Pro 


Thr 


Tyr 


Gin 


Tyr 


Asn 


Met 








60 










He 


He 


He 


Asn 


Asn 


Lys 


Asn 


Phe 






75 










80 


Arg 


Asn 


Gly 


Thr 


Asp 


Lys 


Asp 


Ala 




90 










95 




Ser 


Leu 


Gly 


Phe 


Asp 


Val 


He 


Val 


105 










110 






Met 


Gin 


Asp 


Leu 


Leu 


Lys 


Lys 


Ala 










125 








Ala 


Cys 


Phe 


Ala 


Cys 


He 


Leu 


Leu 








140 










Tyr 


Gly 


Lys 


Asp 


Gly 


Val 


Thr 


Pro 






155 










160 


Arg 


Gly 


Asp 


Arg 


Ser 


Lys 


Thr 


Leu 




170 










175 




He 


Gin 


Ala 


Cys 


Arg 


Gly Thr 


Glu 


185 










190 






Ser 


Gly 


Pro 


He 


Asn 


Asp 


Thr 


Asp 










205 








Val 


Glu 


Ala 


Asp 


Phe 


Leu 


Phe 


Ala 








220 










Ser 


Trp 


Arg 


Ser 


Pro 


Gly 


Arg 


Gly 






235 










240 


Ser 


He 


Leu 


Glu 


Glu 


His 


Gly 


Lys 




250 










255 




Thr 


Arg 


Val 


Asn 


Asp 


Arg 


Val 


Ala 
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260 

Arg His Phe Glu Ser Gin Ser Asp 
275 280 
Gin lie Pro Cys Val Val Ser Met 
290 295 

Gin 
305 



265 270 
Asp Pro His Phe His Glu Lys Lys 

285 

Leu Thr Lys Glu Leu Tyr Phe Ser 

300 



<210> 15 

<211> 262 

<212> PRT 

<213> Streptomyces sp. K15 



<400> 15 




























Val 


Thr 


Lys 


Pro 


Thr 


He 


Ala 


Ala 


Val 


Gly 


Gly Tyr 


Ala 


Met 


Asn 


Asn 


1 








5 










10 










15 




Gly Thr 


Gly 


Thr 


Thr 


Leu 


Tyr 


Thr 


Lys 


Ala 


Ala 


Asp 


Thr 


Arg 


Arg 


Ser 








20 










25 










30 






Thr 


Gly 


Ser 
35 


Thr 


Thr 


Lys 


He 


Met 
40 


Thr 


Ala 


Lys 


val 


Val 
45 


Leu 


Ala 


Gin 


Ser 


Asn 
50 


Leu 


Asn 


Leu 


Asp 


Ala 

55 


Lys 


Val 


Thr 


He 


Gin 
60 


Lys 


Ala 


Tyr 


Ser 


Asp 


Tyr 


Val 


Val 


Ala 


Asn 


Asn 


Ala 


Ser 


Gin 


Ala 


His 


Leu 


He 


Val 


Gly 


65 










70 










75 










80 


Asp 


Lys 


Val 


Thr 


Val 
85 


Arg 


Gin 


Leu 


Leu 


Tyr 
90 


Gly 


Leu 


Met 


Leu 


Pro 
95 


Ser 


Gly 


Cys 


Asp 


Ala 
100 


Ala 


Tyr 


Ala 


Leu 


Ala 
105 


Asp 


Lys 


Tyr 


Gly 


Ser 
110 


Gly 


Ser 


Thr 


Arg 


Ala 


Ala 


Arg 


Val 


Lys 


Ser 


Phe 


He 


Gly Lys 


Met 


Asn 


Thr 


Ala 






115 










120 










125 








Ala 


Thr 
130 


Asn 


Leu 


Gly 


Leu 


His 
135 


Asn 


Thr 


His 


Phe 


Asp 
140 


Ser 


Phe 


Asp 


Gly 


He 


Gly 


Asn 


Gly 


Ala 


Asn 


Tyr 


Ser 


Thr 


Pro 


Arg 


Asp 


Leu 


Thr 


Lys 


He 


145 










150 










155 










160 


Ala 


Ser 


Ser 


Ala 


Met 
165 


Lys 


Asn 


Ser 


Thr 


Phe 
170 


Arg 


Thr 


Val 


Val 


Lys 
175 


Thr 


Lys 


Ala 


Tyr 


Thr 
180 


Ala 


Lys 


Thr 


Val 


Thr 
185 


Lys 


Thr 


Gly 


Ser 


He 
190 


Arg 


Thr 


Met 


Asp 


Thr 
195 


Trp 


Lys 


Asn 


Thr 


Asn 
200 


Gly 


Leu 


Leu 


Ser 


Ser 
205 


Tyr 


Ser 


Gly 


Ala 


He 


Gly 


Val 


Lys 


Thr 


Gly 


Ser Gly 


Pro 


Glu 


Ala 


Lys 


Tyr 


Cys 


Leu 




210 










215 










220 










Val 


Phe 


Ala 


Ala 


Thr 


Arg Gly Gly Lys 


Thr 


Val 


He 


Gly Thr 


Val 


Leu 


225 










230 










235 










240 


Ala 


Ser 


Thr 


Ser 


He 
245 


Pro 


Ala 


Arg 


Glu ' 


Ser 
250 


Asp 


Ala 


Thr 


Lys 


He 
255 


Met 


Asn 


Tyr 


Gly 


Phe 
260 


Ala 


Leu 























<210> 16 
<211> 256 
<212> PRT 

<213> Human cytomegalovirus 
<400> 16 

Met Thr Met Asp Glu Gin Gin Ser 
1 5 
Gly Gly Phe Leu Ala Arg Tyr Asp 

20 

Leu Leu Pro Arg Asp Val Val Glu 
35 40 



Gin Ala Val Ala Pro Val Tyr Val 

10 IS 
Gin Ser Pro Asp Glu Ala Glu Leu 
25 30 
His Trp Leu His Ala Gin Gly Gin 

45 
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Glv 


Gin 


Pro 


Ser 




50 






Asp 


Thr 


Ala 


Val 


65 








Glv 


Leu 


Phe 


Cvs 


Val 


Arg 


Arcr 


Ala 








100 


Ser 


Pro 


Leu 


Gin 






115 










Cor 

OCX 




130 

V 






Thr 


Ser 


Leu 


Ser 


145 








Cys 


Ser 


Val 


Gly 


Pro 


Glu 


Trp 


Val 








180 


Asp 


Gly 


Leu 


Arg 






195 




Ala 


Ser 


Gly 


Asp 




210 






Ser 


Val 


Asp 


Ala 


225 








Asp 


Lys 


Gin 


Leu 



Leu Ser Val Ala 
55 

Val Gly His Val 
70 

Leu Gly Cys Val 
85 

Ser Glu Lys Ser 

Pro Asp Lys Val 

120 

Leu Ser Ser Arg 
135 

Gly Ser Glu Thr 
150 

Arg Arg Arg Gly 
165 

Thr Gin Arg Phe 

Ala Gin Trp Gin 

200 

Pro Phe Arg Ser 
215 

Leu Tyr lie Arg 
230 

Val Gly Val Thr 
245 



Leu Pro Leu Asn 

60 

Ala Ala Met Gin 
75 

Thr Ser Pro Arg 
90 

Glu Leu Val Ser 
105 

Val Glu Phe Leu 

Arg Cys Asp Asp 

140 

Thr Pro Phe Lys 
155 

Thr Leu Ala Val 
170 

Pro Asp Leu Thr 
185 

Arg Cys Gly Ser 

Asp Ser Tyr Gly 

220 

Glu Arg Leu Pro 

235 

Glu Arg Glu Ser 
250 



lie Asn His Asp 

Ser Val Arg Asp 

80 

Phe Leu Glu lie 
95 

Arg Gly Pro Val 
110 

Ser Gly Ser Tyr 
125 

Val Glu Gin Ala 

His Val Ala Leu 

160 

Tyr Gly Arg Asp 
175 

Ala Ala Asp Arg 
190 

Thr Ala Val Asp 
205 

Leu Leu Gly Asn 

Lys Leu Arg Tyr 

240 

Tyr Val Lys Ala 
255 



<210> 17 

<211> 248 

<212> PRT 

<213> Escherichia coli 



<400> 17 

Val Arg Ser Phe 

1 

Met Pro Thr Leu 

20 

Tyr Gly lie Lys 
35 

His Pro Lys Arg 
50 

Lys Leu Asp Tyr 
65 

Thr Tyr Asp Pro 

Ser Gly Gin Ala 

100 

Glu Pro Ser Asp 
115 

Ala Thr Ser Gly 
130 

Gly lie Arg Leu 
145 

Arg lie Leu Thr 

Gin Gin Pro Gly 

180 

Tyr Phe Met Met 
195 

Trp Gly Phe Val 
210 

Trp Met Ser Phe 



lie 


Tyr 


Glu 


Pro 


5 








Leu 


He 


Gly Asp 


Asp 


Pro 


He 


Tyr 








40 


Gly 


Asp 


He 


Val 






55 




lie 


Lys 


Arg 


Ala 




70 






Val 


Ser 


Lys 


Glu 


85 








Cys 


Glu 


Asn 


Ala 


Phe 


Val 


Gin 


Thr 








120 


Phe 


Phe 


Glu 


Val 






135 




Ser 


Glu 


Arg 


Lys 




150 






Val 


Pro 


He 


Ala 


165 








Gin 


Gin 


Leu 


Ala 


Gly 


Asp 


Asn 


Arg 








200 


Pro 


Glu 


Ala 


Asn 






215 




Asp 


Lys 


Gin 


Glu 



Phe Gin He Pro 
10 

Phe He Leu Val 
25 

Gin Lys Thr Leu 

Val Phe Lys Tyr 

60 

Val Gly Leu Pro 
75 

Leu Thr He Gin 
90 

Leu Pro Val Thr 
105 

Phe Ser Arg Arg 

Pro Lys Asn Glu 

140 

Glu Thr Leu Gly 
155 

Gin Asp Gin Val 
170 

Thr Trp He Val 
185 

Asp Asn Ser Ala 

Leu Val Gly Arg 

220 

Gly Glu Trp Pro 



Ser Gly Ser Met 
15 

Glu Lys Phe Ala 
30 

He Glu Thr .Gly 
45 

Pro Glu Asp Pro 

Gly Asp Lys Val 

80 

Pro Gly Cys Ser 
95 

Tyr Ser Asn Val 
110 

Asn Gly Gly Glu 
125 

Thr Lys Glu Asn 

Asp Val Thr His 

160 

Gly Met Tyr Tyr 
175 

Pro Pro Gly Gin 
190 

Asp Ser Arg Tyr 
205 

Ala Thr Ala He 
Thr Gly Leu Arg 
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225 230 235 240 

Leu Ser Arg He Gly Gly He His 

245 



<210> 18 

<211> 317 

<212> PRT 

<213> Serratia marcescens 



<400> 18 


























Met 


Glu 


Gin 


Leu 


Arg 


Gly 


Leu 


Tyr 


Pro 


Pro 


Leu 


Ala 


Ala Tyr Asp 


Ser 


1 








5 










10 








15 




Gly 


Trp 


Leu 


Asp 


Thr 


Gly 


Asp Gly His 


Arg 


He 


Tyr 


Trp Glu 


Leu 


Ser 








20 










25 








30 






Gly 


Asn 


Pro 


Asn 


Gly 


Lys 


Pro 


Ala 


Val 


Phe 


He 


His 


Gly Gly Pro Gly 






35 


■ 








40 










45 






Gly 


Gly 


He 


Ser 


Pro 


His 


His 


Arg 


Gin 


Leu 


Phe 


Asp 


Pro Glu 


Arg 


Tyr 




50 










55 










60 








Lys 


Val 


Leu 


Leu 


Phe 


Asp 


Gin 


Arg 


Gly 


Cys 


Gly 


Arg 


Ser Arg 


Pro 


His 


65 










70 










75 








80 


Ala 


Ser 


Leu 


Asp 


Asn 


Asn 


Thr 


Thr 


Trp 


His 


Leu 


Val 


Ala Asp 


He 


Glu 










85 










90 








95 




Arg 


Leu 


Arg 


Glu 


Met 


Ala 


Gly val 


Glu 


Gin 


Trp 


Leu 


Val Phe 


Gly Gly 








100 










105 








110 






Ser 


Trp 


Gly 


Ser 


Thr 


Leu 


Ala 


Leu 


Ala 


Tyr 


Ala 


Gin 


Thr His 


Pro 


Glu 






115 










120 










125 






Arg 


Val 


Ser 


Glu 


Met 


Val 


Leu 


Arg Gly 


He 


Phe 


Thr 


Leu Arg 


Lys 


Gin 




130 










135 










140 








Arg 


Leu 


His 


Trp 


Tyr 


Tyr 


Gin Asp Gly Ala 


Ser 


Arg 


Phe Phe 


Pro 


Glu 


145 










150 










155 








160 


Lys 


Trp 


Glu 


Arg 


Val 


Leu 


Ser 


lie 


Leu 


Ser 


Asp 


Asp 


Glu Arg 


Lys 


Asp 










165 










170 








175 




Val 


He 


Ala 


Ala 


Tyr 


Arg 


Gin 


Arg 


Leu 


Thr 


Ser 


Ala 


Asp Pro 


Gin 


Val 








180 










185 








190 






Gin 


Leu 


Glu 


Ala 


Ala 


Lys 


Leu 


Trp 


Ser 


Val 


Trp 


Glu Gly Glu 


Thr 


Val 






195 










200 










205 






Thr 


Leu 


Leu 


Pro 


Ser 


Arg 


Glu 


Ser 


Ala 


Ser 


Phe 


Gly Glu Asp 


Asp 


Phe 




210 










215 










220 








Ala 


Leu 


Ala 


Phe 


Ala 


Arg 


He 


Glu 


Asn 


His 


Tyr 


Phe 


Thr His 


Leu 


Gly 


225 










230 










235 








240 


Phe 


Leu 


Glu 


Ser 


Asp 


Asp 


Gin 


Leu 


Leu 


Arg 


Asn 


Val 


Pro Leu 


He 


Arg 










245 










250 








255 




His 


He 


Pro 


Ala 


Val 


He 


Val 


His 


Gly 


Arg 


Tyr 


Asp 


Met Ala 


Cys 


Gin 








260 










265 








270 






Val 


Gin 


Asn 


Ala 


Trp 


Asp 


Leu 


Ala 


Lys 


Ala 


Trp 


Pro 


Glu Ala 


Glu 


Leu 






275 










280 










285 






His 


He 


Val 


Glu 


Gly 


Ala 


Gly His 


Ser 


Tyr 


Asp 


Glu 


Pro Gly 


He 


Leu 




290 










295 










300 








His 


Gin 


Leu 


Met 


He 


Ala 


Thr 


Asp 


Arg 


Phe 


Ala 


Gly 


Lys 







305 310 315 



<210> 19 
<211> 229 
<212> PRT 

<213> Escherichia coli 
<400> 19 

Met Glu Leu Leu Leu Leu Ser Asn Ser Thr Leu Pro Gly Lys Ala Trp 
1 5 10 15 

Leu Glu His Ala Leu Pro Leu He Ala Asn Gin Leu Asn Gly Arg Arg 

20 25 30 
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Ser Ala Val Phe 
35 

Tyr Thr Asp Lys 
50 

Thr Gly He His 
65 

Glu He He He 

Ser Arg Glu Arg 

100 

Gly Ala I*eu Tyr 
115 

Thr He Arg Thr 
130 

Asp Ala Leu Asp 
145 

Ala Leu Pro Glu 

Glu Leu Leu Val 

130 

Gly Asn Trp He 
195 

Asn Thr Thr Trp 
210 

Ala Gly His Arg 
225 



He Pro Phe Ala 

40 

Thr Ala Glu Val 
55 

Arg Val Ala Asp 
70 

Val Gly Gly Gly 
85 

Gly Leu Leu Ala 

He Gly Trp Ser 

120 

Thr Asn Asp Met 

135 . 
Leu Phe Pro Leu 
150 

Gly His Lys Gly 
165 

Val Ala Pro Glu 

Gin Val Ser Asn 

200 

Val Phe Lys Ala 
215 

Phe 



Gly Val Thr Gin 

Leu Ala Pro Leu 

60 

Pro Leu Ala Ala 
75 

Asn Thr Phe Gin 
90 

Pro Met Ala Asp 
105 

Ala Gly Ala Asn 

Pro He Val Asp 

140 

Gin He Asn Pro 
155 

Glu Thr Arg Glu 
170 

Leu Thr Val He 
185 

Gly Gin Ala Val 

Gly Glu Glu Ala 

220 



Thr Trp Asp Glu 
45 

Gly Val Asn Val 

He Glu Lys Ala 

80 

Leu Leu Lys Glu 
95 

Arg Val Lys Arg 
110 

Leu Ala Cys Pro 
125 

Pro Asn Gly Phe 

His Phe Thr Asn 

160 

Gin Arg He Arg 
175 

Gly Leu Pro Glu 
190 

Leu Gly Gly Pro 
205 

Val Ala Leu Glu 



<210> 20 

<:211> 99 

<212> PRT 

<213> Human immunodeficiency virus 



<400> 20 



Pro 


Gin 


He Thr 


Leu 


Trp 


Gin 


Arg 


Pro 


Leu 


Val Thr 


Val 


Lys 


He 


Gly 


1 






5 










10 








15 




Gly 


Gin 


Leu Arg 


Glu 


Ala 


Leu 


Leu 


Asp 


Thr Gly Ala 


Asp 


Asp 


Thr 


Val 






20 










25 








30 






Leu 


Glu 


Asp He 


Asn 


Leu 


Pro 


Gly 


Lys 


Trp 


Lys Pro 


Lys 


Met 


He 


Gly 






35 








40 








45 








Gly 


lie 


Gly Gly 


Phe 


He 


Lys 


Val 


Arg 


Gin 


Tyr Asp 


Gin 


He 


Leu 


He 




50 








55 








60 










Glu 


He 


Cys Gly 


Lys 


Lys 


Ala 


He 


Gly 


Thr 


Val Leu 


Val 


Gly 


Pro 


Thr 


65 








70 










75 








80 


Pro 


Val 


Asn He 


He 


Gly 


Arg 


Asn 


Met 


Leu 


Thr Gin 


He 


Gly 


Cys 


Thr 








85 










90 








95 




Leu 


Asn 


Phe 

























<210> 21 
<211> 297 
<212> PRT 

<213> Escherichia coli 
<400> 21 

Ser Thr Glu Thr Leu Ser Phe Thr 
1 5 

Ser Leu Gly Thr Leu Ser Gly Lys 

20 

Glu Glu Gly Gly Arg Lys Val Ser 

35 40 
Ala Ala He He Lys Gly Ala He 



Pro Asp Asn He Asn Ala Asp He 

10 15 

Thr Lys Glu Arg Val Tyr Leu Ala 

25 30 

Gin Leu Asp Trp Lys Phe Asn Asn 

45 

Asn Trp Asp Leu Met Pro Gin He 
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50 



cp r~ 


lie 

JU W 


Gly Ala 


o o 








Met 


Val 


AD U 


Gl n 


A<?T> 




Ala 


Arn 








100 


Acn 


J_l w 




Tip, 
J> JL C 






115 




Gly 


Leu 


Met 


Ala 




130 








^jxy 




Tyr 


14 5 








Gly 


t>er 


Pne 


Pro 


Lys 


tut a 4- 


Pro 


Tyr 








180 


Glu 


Leu 


Gly 


Gly 






195 




Asn 


Asp 


Glu 


His 




210 






Val 


Lys 


Asp 


Gin 


225 








Val 


Thr 


Pro 


Asn 


Thr 


Asn 


Lys 


Lys 








260 


Ser 


Asp 


Tyr 


Ser 






275 




Thr 


Thr 


Ala Gly 




290 











55 




ax a 


ijj-Ly 


Trp 


Thr 




70 






Asp 


Trp 


Met 


Asp 










alS 


Pro 


Asp 


Thr 


T i 


Gl 


Trp 


Leu 








120 


L»±y 


Tyr 


Gin 


Glu 






135 




lie 


Tyr 


Ser 


Ser 




150 






Asn 


Gly 


Glu Arg 


165 








He 


Gly 


Leu 


Thr 


Thr 


Phe 


Lys 


Tyr 








200 


Tyr 


Asp 


Pro 


Lys 






215 




Asn 


Tyr 


Tyr 


Ser 




230 






Ala 


Lys 


Val 


Tyr 


245 








Gly Asn 


Thr 


Ser 


Lys 


Asn 


Gly Ala 








280 


Leu 


Lys 


Tyr 


Thr 






295 





60 

Thr Leu Gly Ser 
75 

Ser Ser Asn Pro 
90 

Gin Leu Asn Tyr 
105 

Leu Asn Glu Pro 

Ser Arg Tyr Ser 

140 

Glu Glu Gly Phe 
155 

Ala He Gly Tyr 
170 

Gly Ser Tyr Arg 
185 

Ser Gly Trp Val 

Gly Arg He Thr 

220 

Val Ala Val Asn 
235 

Val Glu Gly ^la 
250 

Leu Tyr Asp His 
265 

Gly lie Glu Asn 
Phe 



Arg Gly Gly Asn 

80 

Gly Thr Trp Thr 
95 

Ala Asn Glu Phe 
110 

Asn Tyr Arg Leu 
125 

Phe Thr Ala Arg 

Arg Asp Asp He 

160 

Lys Gin Arg Phe 
175 

Tyr Glu Asp Phe 
190 

Glu Ser Ser Asp 
205 

Tyr Arg Ser Lys 

Ala Gly Tyr Tyr 

240 

Trp Asn Arg Val 
255 

Asn Asn Asn Thr 

. 270 

Tyr Asn Phe He 
285 . 



<210> 22 

<211=> 212 

<212> PRT 

<213> Carica papaya 



<400> 22 

He Pro Glu Tyr 

1 

Lye Asn Gin Gly 

20 

Thr He Glu Gly 
35 

Ser Glu Gin Glu 
50 

Gly Gly Tyr Pro 
65 

His Tyr Arg Asn 

Ser Arg Glu Lys 

100 

Val Gin Pro Tyr 
115 

Pro Val Ser Val 
130 

Arg Gly Gly He 
145 

Val Ala Ala Val 

Trp Gly Thr Gly 

180 



Val 


Asp 


Trp 


Arg 


5 








Ser 


Cys 


Gly Ser 


He 


lie 


Lys 


He 








40 


Leu 


Leu 


Asp 


Cys 






55 




Trp 


Ser 


Ala 


Leu 




70 






Thr 


Tyr 


Pro 


Tyr 


85 








Gly 


Pro 


Tyr Ala 


Asn 


Gin 


Gly Ala 








120 


Val 


Leu 


Gin 


Ala 






13S 




Phe 


Val 


Gly 


Pro 




150 






Gly 


Tyr 


Gly 


Pro 


165 








Trp 


Gly 


Glu 


Asn 



Gin 


Lys 


Gly Ala 




10 






Cys 


Trp 


Ala 


Phe 


25 








Arg 


Thr 


Gly Asn 


Asp 


Arg 


Arg 


Ser 








60 


Gin 


Leu 


Val 


Ala 






75 




Glu Gly 


Val 


Gin 




90 






Ala 


Lys 


Thr 


Asp 


105 








Leu 


Leu 


Tyr 


Ser 


Ala Gly 


Lys 


Asp 








140 


Gys 


Gly 


Asn 


Lys 






155 




Asn 


Tyr 


He 


Leu 




170 






Gly Tyr 


He 


Arg 


185 









Val Thr Pro Val 
15 

Ser Ala Val Val 
30 

Leu Asn Gin Tyr 
45 

Tyr Gly Cys Asn 

Gin Tyr Gly He 

8 0 

Arg Tyr Cys Arg 
95 

Gly Val Arg Gin 
110 

He Ala Asn Gin 
125 

Phe Gin Leu Tyr 
Val Asp His Ala 

160 

He Lys Asn Ser 
175 

He Lys Arg Gly 
190 
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Thr Gly Asn Ser Tyr Gly Val Cys Gly Leu Tyr Thr Ser Ser Phe Tyr 

195 200 205 

Pro Val Lys Asn 
210 



<210> 23 

<211> 699 

<212> PRT 

<213> Homo sapiens 



<400> 23 



Ala 


Gly 


He 


Ala 


Ala 


Lys 


Leu 


Ala 


1 








5 








Leu 


Gly 


Ser 


His 


Glu 


Arg 


Ala 


He 








20 










Ala 


Leu 


Arg 


Asn 


Glu 


Cys 


Leu 


Glu 






35 










40 


Ser 


Phe 


Pro 


Ala 


He 


Pro 


Ser 


Ala 




50 










55 




Tyr 


Ser 


Ser 


Lys 


Thr 


Arg 


Gly Met 


65 










70 






Cys 


Ala 


Asp 


Pro 


Gin 


Phe 


He 


He 










85 








Cys 


Gin 


Gly 


Ala 


Leu 


Gly Asp 


Cys 






. 


100 










Leu 


Thr 


Leu 


Asn 


Glu 


Glu 


He 


Leu 






115 










120 


Ser 


Phe 


Gin 


Glu 


Asn 


Tyr 


Ala 


Gly 




130 










135 




Tyr 


Gly 


Glu 


Trp 


Val 


Glu 


Val 


Val 


145 










150 






' Asp 


Gly 


Glu 


Leu 


Leu 


Phe 


Val 


His 










165 








Ser 


Ala 


Leu 


Leu 


Glu 


Lys 


Ala 


Tyr 








180 










Ala 


Leu 


Ser 


Gly 


Gly 


Ala 


Thr 


Thr 






195 










200 


Gly 


He 


Ala 


Glu 


Trp 


Tyr 


Glu 


Leu 




210 










215 




Lys 


He 


He 


Gin 


Lys 


Ala 


Leu 


Gin 


225 










230 






lie 


Asp 


He 


Thr 


Ser 


Ala 


Ala 


Asp 










245 








Leu 


Val 


Lys 


Gly 


His 


Ala 


Tyr 


Ser 








260 










Ser 


Asn 


Gly 


Ser 


Leu 


Gin 


Lys 


Leu 






275 










280 


Glu 


Val 


Glu 


Trp 


Thr 


Gly Arg 


Trp 




290 










295 




Thr 


He 


Asp 


Pro 


Glu 


Glu 


Arg 


Glu 


305 










310 






Gly 


Glu 


Phe 


Trp 


Met 


Ser 


Phe 


Ser 










325 








Leu 


Glu 


He 


Cys 


Asn 


Leu 


Thr 


Pro 








340 










Lys 


Lys 


Trp 


Lys 


Leu 


Thr 


Lys 


Met 






355 




• 






360 


Thr 


Ala 


Gly 


Gly 


Cys 


Arg 


Asn 


Tyr 




370 










375 




Gin 


Tyr 


Leu 


He 


Lys 


Leu 


Glu 


Glu 


38S 










390 






Glu 


Ser 


Gly 


Cys 


Thr 


Phe 


Leu 


Val 



Lys 


Asp 


Arg 


Glu 


Ala 


Ala 


Glu Gly 




10 










15 




Lys 


Tyr 


Leu 


Asn 


Gin 


Asp 


Tyr 


Glu 


25 










30 






Ala 


Gly 


Thr 


Leu 


Phe 


Gin 


Asp 


Pro 










45 








Leu 


Gly 


Phe 


Lys 


Glu 


Leu 


Gly 


Pro 








60 










Arg 


Trp 


Lys 


Arg 


Pro 


Thr 


Glu 


He 






75 










80 


Gly 


Gly 


Ala 


Thr 


Arg 


Thr 


Asp 


He 




90 










95 




Trp 


Leu 


Leu 


Ala 


Ala 


He 


Ala 


Ser 


105 










110 






Ala 


Arg 


Val 


val 


Pro 


Leu 


Asn 


Gin 










125 








He 


Phe 


His 


Phe 


Gin 


Phe 


Trp 


Gin 








140 










Val 


Asp 


Asp 


Arg 


Leu 


Pro 


Thr 


Lys 






155 








• 


160 


Ser 


Ala 


Glu 


Gly 


Ser 


Glu 


Phe 


Trp 




170 










175 




Ala 


Lys 


He 


Asn 


Gly Cys 


Tyr 


Glu 


185 










190 






Glu 


Gly 


Phe 


Glu 


Asp 


Phe 


Thr 


Gly 










205 








Lys 


Lys 


Pro 


Pro 


Pro 


Asn 


Leu 


Phe 








220 










Lys 


Gly 


Ser 


Leu 


Leu 


Gly 


Cys 


Ser 






235 










240 


Ser 


Glu 


Ala 


He 


Thr 


Phe 


Gin 


Lys 




250 










255 




Val 


Thr 


Gly 


Ala 


Glu 


Glu 


Val 


Glu 


265 










270 






He 


Arg 


He 


Arg 


Asn 


Pro 


Trp 


Gly 










285 








Asn 


Asp 


Asn 


Cys 


Pro 


Ser 


Trp 


Asn 








300 










Arg 


Leu 


Thr 


Arg 


Arg 


His 


Glu 


Asp 






315 










320 


Asp 


Phe 


Leu 


Arg 


His 


Tyr 


Ser 


Arg 




330 










335 




Asp 


Thr 


Leu 


Thr 


Ser 


Asp 


Thr 


Tyr 


345 










350 






Asp 


Gly 


Asn 


Trp 


Arg 


Arg 


Gly 


Ser 










365 








Pro 


Asn 


Thr 


Phe 


Trp 


Met 


Asn 


Pro 








380 










Glu Asp 


Glu 


Asp 


Glu 


Glu 


Asp 


Gly 






395 










400 


Gly Leu 


He 


Gin 


Lys 


His 


Arg 


Arg 



J 
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Arg Gin Arg Lys 

420 

Tyr Glu Val Pro 
435 

Lys Asn Phe Phe 
450 

He Asn Leu Arg 
465 

Tyr He Leu Val 

Cys He Arg Val 

500 

Asp Glu He Glu 
515 

He Asp Asp Gly 
53 0 

Ala Glu He Ser 
545 

Ala Lys Arg Gin 

Lys He Met Val 

5B0 

Leu Lys Glu Phe 
595 

lie Tyr Arg Glu 
610 

Glu Met Arg Lys 
625 

Leu His Gin Val 

Asp Phe Asp Asn 

660 

Lys He Phe Lys 
675 

Asp Leu He Ser 
690 



405 

Met Gly Glu Asp 

Glu Glu Leu Ser 

440 

Leu Thr Asn Arg 
455 

Glu Val Leu Asn 
470 

Pro Ser Thr Phe 
485 

Phe Ser Glu Lys 

Ala Asn Leu Glu 

520 

Val Arg Arg Leu 
535 

Ala Phe Glu Leu 
550 

Asp He Lys Ser 
565 

Asp Met Leu Asp 

Tyr He Leu Trp 

600 

He Asp Val Asp 
615 

Ala Leu Glu Glu 
630 

He Val Ala Arg 
645 

Phe Val Arg Cys 

Gin Leu Asp Pro 

680 

Trp Leu Cys Phe 
695 



410 

Met His Thr He 
425 

Gly Gin Thr Asn 

Ala Arg Glu Arg 

460 

Arg Phe Lys Leu 
475 

Glu Pro Asn Lys 
490 

Lys Ala Asp Tyr 
505 

Glu Phe Asp He 

Phe Ala Gin Leu 

540 

Gin Thr He Leu 
555 

A Bp Gly Phe Ser 
570 

Ser Asp Gly Ser 
585 

Thr Lys He Gin 

Arg Ser Gly Thr 

620 

Ala Gly Phe Lys 
635 

Phe Ala Asp Asp 
650 

Leu Val Arg Leu 
665 

Glu Asn Thr Gly 
Ser Val Leu 



415 

Gly Phe Gly He 
430 

He His Leu Ser 
445 

Ser Asp Thr Phe 

Pro Pro Gly Glu 

480 

Asp Gly Asp Phe 
495 

Gin Ala Val Asp 

. 510 
Ser Glu Asp Asp 
525 

Ala Gly Glu Asp 

Arg Arg Val Leu 

560 

He Glu Thr Cys 
575 

Gly Lys Leu Gly 
590 

Lys Tyr Gin Lys 
605 

Met Asn Ser Tyr 

Met Pro Cys Gin 

640 

Gin Leu lie He 
655 

Glu Thr Leu Phe 
670 

Thr He Glu Leu 
685 



<210> 24 

<211> 221 

<212> PRT 

<213> Tobacco etch virus 



<400> 


24 




























Gly 


Glu 


Ser 


Leu 


Phe 


Lys 


Gly 


Pro 


Arg 


Asp 


Tyr 


Asn 


Pro 


He 


Ser 


Ser 


1 








5 










10 










15 




Thr 


He 


Cys 


His 


Leu 


Thr 


Asn 


Glu 


Ser 


Asp 


Gly His 


Thr 


Thr 


Ser 


Leu 








20 










25 










30 






Tyr 


Gly 


He 
35 


Gly 


Phe 


Gly 


Pro 


Phe 
40 


He 


lie 


Thr 


Asn 


Lye 
45 


His 


Leu 


Phe 


Arg 


Arg 


Asn 


Asn 


Gly Thr 


Leu 


Leu 


Val 


Gin 


Ser 


Leu 


His 


Gly 


Val 


Phe 




50 










55 










60 








Lys 


Val 


Lys 


Asn 


Thr 


Thr 


Thr 


Leu 


Gin 


Gin 


His 


Leu 


He 


Asp 


Gly Arg 


65 










70 










75 








80 


Asp 


Met 


He 


He 


He 
85 


Arg 


Met 


Pro 


Lys 


Asp 
90 


Phe 


Pro 


Pro 


Phe 


Pro 
95 


Gin 


Lys 


Leu 


Lys 


Phe 

100 


Arg 


Glu 


Pro 


Gin 


Arg 
105 


Glu 


Glu 


Arg 


He 


cys 

110 


Leu 


Val 


Thr 


Thr 


Asn 


Phe 


Gin 


Thr 


Lys 


Ser 


Met 


Ser 


Ser 


Met 


Val 


Ser 


Asp 


Thr 






115 










120 










125 






Ser 


Cys 
130 


Thr 


Phe 


Pro 


Ser 


Ser 
135 


Asp 


Gly 


He 


Phe 


Trp 
140 


Lys 


His 


Trp 


He 
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Gin Thr Lys Asp 
145 

Gly Phe lie Val 

Asn Tyr Phe Thr 

180 

Gin Glu Ala Gin 
195 

Val lieu Trp Gly 
210 



Gly Gin Cys Gly 
150 

Gly lie His Ser 
165 

Ser Val Pro Lys 

Gin Trp Val Ser 

200 

Gly His Lys Val 
215 



Ser Pro Leu Val 
155 

Ala Ser Asn Phe 
170 

Asn Phe Met Glu 
185 

Gly Trp Arg Leu 

Phe Met Asp Lys 

220 



Ser Thr Arg Asp 

160 

Thr Asn Thr Asn 
175 

Leu Leu Thr Asn 
190 

Asn Ala Asp Ser 

205 

Pro 



<210> 25 

<211> 371 

<212> PRT 

<213> Streptococcus pyogenes 



<400> 


25 




























Asp 


Gin 


Asn 


Phe 


Ala 


Arg 


Asn 


Glu 


Lys 


Glu 


Ala 


Lys 


Asp 


Ser 


Ala 


He 


1 








5 










10 










15 




Thr 


Phe 


lie 


Gin 


Lys 


Ser 


Ala 


Ala 


He 


Lys 


Ala 


Gly Ala 


Arg 


Ser 


Ala 








20 










25 










30 






Glu 


Asp 


lie 


Lys 


Leu 


Asp 


Lys 


Val 


Asn 


Leu 


Gly 


Gly Glu 


Leu 


Ser 


Gly 






35 










40 










45 








Ser 


Asn 


Met 


Tyr 


Val 


Tyr 


Asn 


He 


Ser 


Thr 


Gly Gly Phe 


Val 


He 


Val 




50 










55 










60 










Ser Gly Asp 


Lys 


Arg 


Ser 


Pro 


Glu 


He 


Leu 


Gly 


Tyr 


Ser 


Thr 


Ser 


Gly 


65 










70 










75 










80 


Ser 


Phe 


Asp 


Val 


Asn 
85 


Gly 


Lys 


Glu 


Asn 


He 
90 


Ala 


Ser 


Phe 


Met 


Glu 

95 


Ser 


Tyr 


Val 


Glu 


Gin 
100 


lie 


Lys 


Glu 


Asn 


Lys 
105 


Lys 


Leu 


Asp 


Ser 


Thr 
110 


Tyr 


Ala 


Gly 


Thr 


Ala 
115 


Glu 


lie 


Lys 


Gin 


Pro 
120 


Val 


Val 


Lys 


Ser 


Leu 
125 


Leu 


Asp 


Ser 


Lys 


Gly 


lie 


His 


Tyr 


Asn 


Gin 


Gly Asn 


Pro 


Tyr 


Asn 


Leu 


Leu 


Thr 


Pro 




130 










135 










140 










Val 


lie 


Glu 


Lys 


Val 


Lys 


Pro 


Gly 


Glu 


Gin 


Ser 


Phe 


Val 


Gly 


Gin 


His 


145 










150 










155 










160 


Ala 


Ala 


Thr 


Gly 


Ser 
165 


val 


Ala 


Thr 


Ala 


Thr 
170 


Ala 


Gin 


He 


Met 


Lys 
175 


Tyr 


His 


Asn 


Tyr 


Pro 


Asn 


Lys 


Gly 


Leu 


Lys 


Asp 


Tyr 


Thr 


Tyr. Thr 


Leu 


Ser 








180 










185 










190 






Ser 


Asn 


Asn 
195 


Pro 


Tyr 


Phe 


Asn 


His 
200 


Pro 


Lys 


Asn 


Leu 


Phe 
205 


Ala 


Ala 


He 


Ser 


Thr 


Arg 


Gin 


Tyr 


Asn 


Trp 


Asn 


Asn 


He 


Leu 


Pro 


Thr 


Tyr 


Ser 


Gly 




210 










215 










220 








Arg 


Glu 


Ser 


Asn 


Val 


Gin 


Lys 


Met 


Ala 


He 


Ser 


Glu 


Leu 


Met 


Ala 


Asp 


225 










230 










235 










240 


Val 


Gly 


lie 


Ser 


Val 
245 


Asp 


Met 


Asp 


Tyr 


Gly 
250 


Pro 


Ser 


Ser 


Gly 


Ser 
255 


Ala 


Gly 


Ser 


Ser 


Arg 


Val 


Gin 


Arg Ala 


Leu 


Lys 


Glu 


Asn 


Phe 


Gly 


Tyr 


Asn 








260 










265 










270 






Gin 


Ser 


Val 


His 


Gin 


He 


Asn Arg 


Gly Asp 


Phe 


Ser 


Lys 


Gin 


Asp 


Trp 






275 










280 










285 








Glu 


Ala 
290 


Gin 


lie 


Asp 


Lys 


Glu 
295 


Leu 


Ser 


Gin 


Asn 


Gin 
300 


Pro 


Val 


Tyr 


Tyr 


Gin 


Gly Val 


Gly 


Lys 


Val 


Gly Gly His Ala 


Phe 


Val 


He 


Asp 


Gly 


Ala 


305 










310 










315 










320 


Asp 


Gly Arg 


Asn 


Phe 


Tyr 


His 


Val 


Asn 


Trp 


Gly Trp 


Gly 


Gly Val 


Ser 










325 










330 










335 




Asp 


Gly 


Phe 


Phe 


Arg 


Leu 


Asp Ala 


Leu 


Asn 


Pro 


Ser 


Ala 


Leu 


Gly 


Thr 








340 










345 










350 






Gly Gly 


Gly 


Ala 


Gly 


Gly 


Phe 


Asn 


Gly Tyr 


Gin 


Ser 


Ala 


Val 


Val 


Gly 






355 










360 










365 
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lie Lys Pro 
370 



<210> 26 

<211> 353 

<212> PRT 

<213> Homo sapiens 



<400> 26 

Lys Lys His Thr 

1 

Tyr Met Asn Ser 

20 

Lys Ala Val Tyr 
35 

Val Pro Leu Ala 
50 

Lys Pro Val Gly 

65 

Leu Asp Ser Phe 

Leu Asp Asn Val 

100 

lie Pro Lys Leu 
115 

Glu Val Asp Tyr 
130 

Leu Ser lie Lys 
145 

Val Ala Val Glu 

His Gly Leu Gin 

180 

Pro Val Leu His 
195 

Asp Gin Asn lie 
210 

Pro Leu Asp Glu 
225 

Tyr lie Leu His 

His Tyr Val Val 

260 

Phe Asp Asp Asp 
275 

His Asn Tyr Gly 
290 

Asn Ala Tyr Met 
305 

Leu Gin Ala Val 

Leu Gin Glu Glu 

340 

Glu 



Gly 


Tyr 


Val 


Gly 


5 








Leu 


Leu 


Gin 


Thr 


Met 


Met 


Pro 


Thr 








40 


Leu 


Gin 


Arg Val 






55 




Thr 


Lys 


Lys 


Leu 




70 






Met 


Gin 


His 


Asp 


85 








Glu 


Asn 


Lys 


Met 


Phe 


Arg 


Gly Lys 








120 


Arg 


Ser 


Asp Arg 






135 




Gly 


Lys 


Lys 


Asn 




150 






Gin 


Leu 


Asp Gly 


165 








Glu 


Ala 


Glu 


Lys 


Leu 


Gin 


Leu 


Met 








200 


Lys 


He 


Asn 


Asp 






215 




Phe 


Leu 


Gin 


Lys 




230 






Ala 


Val 


Leu 


Val 


245 








Tyr 


Leu 


Asn 


Pro 


Val 


Val 


Ser 


Arg 








280 


Gly 


His 


Asp 


Asp 






295 




Leu 


Val 


T*yr 


He 




310 






Thr 


Asp 


His 


Asp 


325 








Lys 


Arg 


lie 


Glu 



Leu 


Lys 


Asn 


Gin 




10 






Leu 


Phe 


Phe 


Thr 


25 








Glu 


Gly 


Asp 


Asp 


Phe 


Tyr 


Glu 


Leu 








60 


Thr 


Lys 


Ser 


Phe 






75 




Val 


Gin 


Glu 


Leu 




90 






Lys 


Gly 


Thr 


Cys 


105 








Met 


Val 


Ser 


Tyr 


Arg 


Glu 


Asp 


Tyr 








140 


He 


Phe 


Glu 


Ser 






155 




Asp 


Asn 


Lys 


Tyr 




170 






Gly 


Val 


Lys 


Phe 


185 








Arg 


Phe 


Met 


Tyr 


Arg 


Phe 


Glu 


Phe 








220 


Thr 


Asp 


Pro 


Lys 






235 




His 


Ser 


Gly 


Asp 




250 






Lys 


Gly 


Asp 


Gly 


265 








Cys 


Thr 


Lys 


Glu 


Asp 


Leu 


Ser 


Val 








300 


Arg 


Glu 


Ser 


Lys 






315 




He 


Pro 


Gin 


Gin 




330 






Ala 


Gin 


Lys 


Arg 



345 



Gly Ala Thr Cys 
15 

Asn Gin Leu Arg 
30 

Ser Ser Lys Ser 
45 

Gin His Ser Asp 

Gly Trp Glu Thr 

80 

Cys Arg Val Leu 
95 

Val Glu Gly Thr 
110 

He Gin Cys Lys 
125 

Tyr Asp He Gin 

Phe Val Asp Tyr 

160 

Asp Ala Gly Glu 
175 

Leu Thr Leu Pro 
190 

Asp Pro Gin Thr 
205 

Pro Glu Gin Leu 

Asp Pro Ala Asn 

240 

Asn His Gly Gly 
255 

Lys Trp Cys Lys 
270 

Glu Ala He Glu 
285 

Arg His Cys Thr 

Leu Ser Glu Val 

320 

Leu Val Glu Arg 

33S 

Lys Glu Arg Gin 
350 



<210> 27 

<211> 174 

<212> PRT 

<213> Staphylococcus aureus 
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<400> 27 

Tyr Asn Glu Gin 

1 

Thr Gin Gly Asn 

20 

Leu Asn Ala Thr 
35 

Arg Phe Leu His 
50 

Leu Thr Pro Arg 
65 

Pro Gin Leu Leu 

Thr Lys Asn Asn 

100 

Arg Asn Gly Met 
115 

Lys Leu Asn Asn 
130 

Asn Gly Phe Met 
145 

Asn Gly Asp His 



Tyr Val Asn Lys 
5 

Asn Gly Trp Cys 

Tyr Asn Thr Asn 

40 

Pro Asn Leu Gin 

' 55 

Glu Met lie Tyr 
70 

Asn Arg Met Thr 
85 

Lys Gly He Ala 

His Ala Gly His 

120 

Gly Gin Glu Val 
135 

Thr Gin Asp Ala 
150 

Tyr Gin Trp Tyr 
165 



Leu Glu Asn Phe 
10 

Ala Gly Tyr Thr 
25 

Lys Tyr His Ala 

Gly Gin Gin Phe 

60 

Phe Gly Gin Thr 
75 

Thr Tyr Asn Glu 
90 

He Leu Gly Ser 
105 

Ala Met Ala Val 

He He He Trp 

140 

Lys Asn Asn Val 
155 

Ser Ser He Tyr 
170 



Lys He Arg Glu 
15 

Met Ser Ala Leu 
30 

Glu Ala Val Met 
45 

Gin Phe Thr Gly 

Gin Gly Arg Ser 

80 

Val Asp Asn Leu 
95 

Arg Val Glu Ser 
110 

Val Gly Asn Ala 
125 

Asn Pro Trp Asp 

He Pro Val Ser 

160 

Gly Tyr 



<210> 28 

<211> 221 

<212> PRT 

<213> Saccharomyces cerevisiae 



<400> 28 

Gly Ser Leu Val 

1 

Lys Ala Leu Ala 

20 

He Glu He Thr 
35 

Leu Asn Asp Thr 
50 

Thr Pro Asn Thr 
65 

Glu Arg Gly Tyr 

Gin He Asp Lys 

100 

Ser His Trp Ala 
115 

Tyr Val Asp Ser 
130 

Leu Thr Asp Leu 
145 

Gly Glu Asp Phe 

Gly Tyr Asp Cys 

180 

Ala Asp Ala Pro 
195 

Arg Phe He Ala 
210 



Pro 


Glu 


Leu 


Asn 


5 








Ser 


Arg 


Glu 


Asn 


Val 


Arg 


Asp 


Phe 








40 


He 


He 


Glu 


Phe 






55 




Val 


Ala 


Phe 


Asn 




70 






Gin 


Gly 


Val 


Arg 


85 








Leu 


Asp 


Lys 


He 


Leu 


Gly 


He 


He 








120 


Leu 


Ser 


Asn 


Gly 






135 




Gin 


Lys 


Tyr 


Val 




150 






Asp 


Leu 


He 


His 


165 








Gly 


He 


Tyr 


val 


Leu 


Asp 


Phe 


Asp 








200 


His 


Leu 


He 


Leu 






215 





Glu Lys Asp Asp 
10 

Thr Gin Leu Met 
25 

Lys Thr Leu Ala 

Phe Met Lys Tyr 

60 

Ser Phe Phe Tyr 
75 

Arg Trp Met Lys 
90 

Phe Thr Pro He 
105 

Asp Leu Lys Lys 

Pro Asn Ala Met 

140 

Met Glu Glu Ser 
155 

Leu Asp Cys Pro 
170 

Cys Met Asn Thr 
185 

Tyr Lys Asp Ala 

Thr Asp Ala Leu 

220 



Asp Gin Val Gin 
15 

Asn Arg Asp Asn 
30 

Pro Arg Arg Trp 
45 

He Glu Lys Ser 

Thr Asn Leu Ser 

80 

Arg Lys Lys Thr 
95 

Asn Leu Asn Gin 
110 

Ly3 Thr He Gly 
125 

Ser Phe Ala He 

Lys His Thr He 

160 

Gin Gin Pro Asn 
175 

Leu Tyr Gly Ser 
190 

He Arg Met Arg 

205 

Lys 



<210> 29 
<211> 166 
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<212> PRT 

<213> Pyrococcus horikoshii 



<400> 29 



Met 


uys 


Val 


Leu 


trie 


Leu 


inr 


Ala 


Asn 


GlU 


Pne 


Glu 


Asp Val 


Glu 


Leu 


n 
J. 


















i n 

JL \J 








15 




lie 


Tyr 


Pro 


Tyr 
20 


HIS 


Arg 


Leu 


Lys 


Glu 

25 


GlU 


Gly 


HIS 


Glu Val 
30 


Tyr 


He 


Ala 


Ser 


Phe 


Glu 


Arg 


Gly 


Thr 


He 


Thr 


Gly 


Lys 


His 


Gly Tyr Ser Val 






35 










40 










45 






I>ys 


Val 
50 


Asp 


Leu 


Thr 


Phe 


Asp 
55 


Lys 


Val 


Asn 


Pro 


Glu 
60 


Glu Phe 


Asp 


Ala 


Leu 


Val 


Leu 


Pro 


Gly 


Gly 


Arg 


Ala 


Pro 


Glu 


Arg 


Val 


Arg Leu 


Asn 


Glu 


65 










70 










75 








80 


Lys 


Ala 


Val 


Ser 


He 
85 


Ala 


Arg 


Lys 


Met 


Phe 
90 


Ser 


Glu 


Gly Lys 


Pro 
95 


Val 


Ala 


Ser 


He 


Cys 


His 


Gly 


Pro 


Gin 


He 


Leu 


He 


Ser 


Ala Gly val 


Leu 








100 










105 








110 






Arg 


Gly 


Arg 
115 


Lys 


Gly 


Thr 


Ser 


Tyr 
120 


Pro 


Gly 


He 


Lys 


Asp Asp 
125 


Met 


He 


Asn 


Ala 


Gly Val 


Glu 


Trp 


Val 


Asp Ala 


Glu 


Val 


Val 


Val Asp 


Gly Asn 




130 










135 










140 








Trp 


Val 


Ser 


Ser 


Arg 


Val 


Pro 


Ala 


Asp 


Leu 


Tyr 


Ala 


Trp Met 


Arg 


Glu 


145 










150 










155 








160 


Phe 


val 


Lys 


Leu 


Leu 
165 


Lys 





















<210> 30 

<211> 316 

<212> PRT 

<213> Bacillus thermoproteolyticus 



<400> 30 



He 


Thr 


Gly 


Thr 


Ser 


Thr 


Val 


Gly 


Val 


Gly 


Arg 


Gly 


Val 


Leu 


Gly Asp 


1 








5 










10 










15 


Gin 


Lys 


Asn 


He 


Asn 


Thr 


Thr 


Tyr 


Ser 


Thr 


Tyr 


Tyr 


Tyr 


Leu 


Gin Asp 








20 










25 










30 




Aen 


Thr 


Arg 


Gly 


Asp 


Gly 


He 


Phe 


Thr 


Tyr 


Asp 


Ala 


Lys 


Tyr 


Arg Thr 






35 










40 










45 






Thr 


Leu 


Pro 


Gly 


Ser 


Leu 


Trp 


Ala 


Asp 


Ala 


Asp 


Asn 


Gin 


Phe 


Phe Ala 




50 










55 










60 








Ser 


Tyr 


Asp 


Ala 


Pro 


Ala 


Val 


Asp 


Ala 


His 


Tyr 


Tyr 


Ala 


Gly 


Val Thr 


65 










70 










75 








80 


Tyr 


Asp 


Tyr 


Tyr 


Lys 


Asn 


Val 


His 


Asn 


Arg 


Leu 


Ser 


Tyr 


Asp 


Gly Asn 










85 










90 










95 


Asn 


Ala 


Ala 


He 


Arg 


Ser 


Ser 


Val 


His 


Tyr 


Ser 


Gin 


Gly 


Tyr 


Asn Asn 








100 










105 










110 




Ala 


Phe 


Trp 


Asn 


Gly 


Ser 


Glu 


Met 


Val 


Tyr 


Gly 


Asp 


Gly Asp Gly Gin 






115 










120 










125 






Thr 


Phe 


He 


Pro 


Leu 


Ser 


Gly 


Gly 


He 


Asp 


Val 


Val 


Ala 


His 


Glu Leu 




130 










135 










140 








Thr 


His 


Ala 


Val 


Thr 


Asp 


Tyr 


Thr 


Ala 


Gly 


Leu 


He 


Tyr 


Gin 


Asn Glu 


145 










150 










155 








160 


Ser 


Gly 


Ala 


He 


Asn 


Glu 


Ala 


He 


Ser 


Asp 


He 


Phe 


Gly Thr 


Leu Val 










165 










170 










175 


Glu 


Phe 


Tyr 


Ala 


Asn 


Lys 


Asn 


Pro 


Asp 


Trp 


Glu 


He 


Gly Glu Asp . Val 








180 










185 










190 




Tyr 


Thr 


Pro 


Gly 


lie 


Ser 


Gly 


Asp 


Ser 


Leu 


Arg 


Ser 


Met 


Ser 


Asp Pro 






195 










200 










205 






Ala 


Lys 


Tyr 


Gly 


Asp 


Pro 


Asp^ His 


Tyr 


Ser 


Lys 


Arg 


Tyr 


Thr 


Gly Thr 




210 










215 










220 








Gin 


Asp 


Asn 


Gly 


Gly 


Val 


His 


He 


Asn 


Ser 


Gly 


He 


He 


Asn 


Lys Ala 
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225 

Ala Tyr Leu He 

Gly He Gly Arg 

260 

Gin Tyr Leu Thr 
275 

Val Gin Ser Ala 
290 

Ser Val Lys Gin 
305 



230 



Ser 


Gin 


Gly Gly 


245 






Asp 


Lys 


Leu Gly 


Pro 


Thr 


Ser Asn 






280 


Thr 


Asp 


Leu Tyr 






295 


Ala 


Phe 


Asp Ala 




310 





235 

Thr His Tyr Gly 
250 

Lys He Phe Tyr 
265 

Phe Ser Gin Leu 

Gly Ser Thr Ser 

300 

Val Gly Val Lys 
315 



240 

Val Ser Val Val 
255 

Arg Ala Leu Thr 
270 

Arg Ala Ala Ala 
285 

Gin Glu Val Ala 



<210> 31 

<211> 169 

<212> PRT 

<213> Homo sapiens 



<400> 31 

Val Leu Thr Glu 

1 

Arg He Glu Asn 

20 

Ala He Glu Lys 
35 

Phe Thr Lys Val 

. 50 
Arg Gly Asp His 
65 

Leu Ala His Ala 

Phe Asp Glu Asp 

100 

His Arg Val Ala 
115 

Ser Thr Asp He 
130 

Asp Val Gin Leu 
145 

Gly Arg Ser Gin 



Gly Asn Pro Arg 

5 

Tyr Thr Pro Asp 

Ala Phe Gin Leu 

40 

Ser Glu Gly Gin 
55 

Arg Asp Asn Ser 
70 

Phe Gin Pro Gly 
85 

Glu Arg Trp Thr 

Ala His Glu Leu 

120 

Gly Ala Leu Met 
135 

Ala Gin Asp Asp 
150 

Asn Pro Val Gin 
165 



Trp Glu Gin Thr 
10 

Leu Pro Arg Ala 
25 

Trp Ser Asn Val 

Ala Asp He Met 

60 

Pro Phe Asp Gly 
75 

Pro Gly He Gly 
90 

Asn Asn Phe Arg 
105 

Gly His Ser Leu 

Tyr Pro Ser Tyr 

140 

He Asp Gly He 

155 

Pro 



His 


Leu 


Thr 


Tyr 






15 




Asp 


Val 


Asp 


His 




30 






Thr 


Pro 


Leu 


Thr 


45 








He 


Ser 


Phe 


Val 


Pro 


Gly 


Gly Asn 








80 


Gly Asp 


Ala 


His 






95 




Glu 


Tyr 


Asn 


Leu 




110 






Gly Leu 


Ser 


His 


125 








Thr 


Phe 


Ser 


Gly 


Gin 


Ala 


He 


Tyr 








160 



<210> 32 

<211> 496 

<212> PRT 

<213> Homo sapiens 



<400> 32 



Gin 


Tyr 


Ser 


Pro 


1 








Phe 


Glu 


Trp 


Arg 








20 


Ala 


Pro 


Lys 


Gly 






35 




Val 


Ala 


He 


Tyr 




50 






Val 


Ser 


Tyr 


Lys 


65 








Asn 


Met 


Val 


Thr 


Ala 


Val 


He 


Asn 



Asn Thr Gin Gin 
5 

Trp Val Asp He 

Phe Gly Gly Val 

40 

Asn Pro Phe Arg 
55 

Leu Cys Thr Arg 

70 

Arg Cys Asn Asn 
85 

His Met Cys Gly 



Gly Arg Thr Ser 
10 

Ala Leu Glu Cys 
25 

Gin Val Ser Pro 

Pro Trp Trp Glu 

60 

Ser Gly Asn Glu 
75 

Val Gly Val Arg 
90 

Asn Ala Val Ser 
105 



He 


Val 


His 


Leu 






15 




Glu 


Arg 


Tyr 


Leu 




30 






Pro 


Asn 


Glu 


Asn 


45 








Arg 


Tyr 


Gin 


Pro 


Asp 


Glu 


Phe 


Arg 








80 


He 


Tyr 


Val 


Asp 






95 




Ala 


Gly 


Thr 


Ser 




110 
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Ser 


Thr 


Cys 


Gly Ser 


Tyr 


Phe 


Ion 

nail 


Pro 


Gly Ser Arg 


Asp 


Phe 


Pro 


Ala 






115 










-L ^ \J 










125 








Val 


Pro 


Tyr 


Ser 


Gly 


Trp Asp 


riie 


Asn 


Asp 


Gly 


Lys 


Cys 


Lys 


Thr 


Gly 




130 










135 










14 0 










Ser Gly Asp 


He 


Glu 


Asn 


■ Tyr 


A Q T\ 


Asp Ala 


Thr 


Gin 


Val 


Arg 


Asp 


Cys 


1 /tC 

.Lft ~> 










150 










155 










160 


Ar 9 


Leu 


Thr 


Gly Leu 


Leu Asp 




Ala 


Leu 


Glu 


Lys 


Asp Tyr Val Arg 










165 










170 










175 




C A V 


Lys 


lie 


Ala 


Glu 


Tyr 


Met 




His 


Leu 


lie 


Asp 


He 


Gly 


Val 


Ala 








180 










185 










190 






Gly 


Phe 


Arg 


Leu 


Asp 


Ala 


Ser 


Lys 


His 


Met 


Trp 


Pro 


Gly Asp 


He 


Lys 






195 










o n n 










205 








Ala 


lie 


Leu 


Asp 


Lys 


Leu 


His 


Asn 


Leu 


Asn 


Ser 


Asn 


Trp 


Phe 


Pro 


Ala 




210 










215 










220 










GJ.y 


Ser 


Lys 


Pro 


Phe 


He 


Tyr 


V7-Ln 


Glu 


Val 


He 


Asp 


Leu 


Gly 


Gly 


Glu 


225 










230 










235 










240 


Pro 


lie 


Lys 


Ser 


Ser 


Asp 


Tyr 


File 


Gly Asn 


Gly Arg 


Val 


Thr 


Glu 


Phe 










245 










250 










255 




Lys 


Tyr 


Gly 


Ala 


Lys 


Leu 


Gly 


Thr 


Val 


He 


Arg 


Lys 


Trp 


Asn 


Gly 


Glu 








260 










265 










270 






Lys 


Met 


Ser 


Tyr 


Leu 


Lys 


Asn 


Trp 


Gly Glu 


Gly 


Trp 


Gly 


Phe 


Val 


Pro 






275 










280 










285 








Ser 


Asp Arg 


Ala 


Leu 


Val 


Phe 


Val 


Asp 


Asn 


His 


Asp 


Asn 


Gin 


Arg 


Gly 




290 










295 










300 






IT ' _ 

His 


Gly Ala 


Gly Gly 


Ala 


Ser 


He 


Leu 


Thr 


Phe 


Trp 


Asp 


Ala 


Arg 


Leu 


305 










310 










315 








320 


Tyr 


Lys 


Met 


Ala 


Val 


Gly 


Phe 


Met 


Leu 


Ala 


His 


Pro 


Tyr 


Gly 


Phe 


Thr 










325 










330 










335 




Arg 


Val 


Met 


Ser 


Ser 


Tyr 


Arg 


Trp 


Pro 


Arg 


Gin 


Phe 


Gin 


Asn 


Gly Asn 








340 










345 
















Asp 


Val 


Asn 


Asp 


Trp 


Val 


Gly 


Pro 


Pro 


Asn 


Asn 


Asn 


Gly 


Val 


He 


Lys 




355 








360 








365 






Glu 


Val 


Thr 


He 


Asn 


Pro 


Asp 


Thr 


Thr 


Cys 


Gly Asn 


Asp 


Trp 


Val 


Cys 




370 










375 










380 










Glu 


His 


Arg 


Trp 


Arg 


Gin 


He 


Arg 


Asn 


Met 


Val 


He 


Phe 


Arg 


Asn 


Val 


385 










390 










395 










400 


Val 


Asp 


Gly 


Gin 


Pro 


Phe 


Thr 


Asn 


Trp 


Tyr 


Asp 


Asn 


Gly 


Ser 


Asn 


Gin 










405 










410 










415 




Val 


Ala 


Phe 


Gly Arg 


Gly 


Asn 


Arg 


Gly 


Phe 


He 


Val 


Phe 


Asn 


Asn 


Asp 








420 










425 










430 






Asp 


Trp 


Ser 


Phe 


Ser 


Leu 


Thr 


Leu 


Gin 


Thr 


Gly Leu 


Pro 


Ala 


Gly 


Thr 






435 










440 










445 






Tyr 


Cys 


Asp 


Val 


He 


Ser 


Gly 


Asp 


Lys 


He 


Asn 


Gly 


Asn 


Cys 


Thr 


Gly 




450 










455 










460 










lie 


Lys 


He 


Tyr 


Val 


Ser 


Asp 


Asp Gly Lys 


Ala 


His 


Phe 


Ser 


lie 


Ser 


465 










470 










475 










480 


Asn 


Ser 


Ala 


Glu 


Asp 


Pro 


Phe 


He 


Ala 


He 


His 


Ala 


Glu 


Ser 


Lys 


Leu 










485 










490 










495 




<210> 33 




























<211> 370 




























<212> PRT 




























<213> Trichoderma reesei 




















<400> 33 




























Gin 


Pro 


Gly 


Thr 


Ser 


Thr 


Pro 


Glu 


Val 


His 


Pro 


Lys 


Leu 


Thr 


Thr 


Tyr 


1 








5 










10 










15 


Lys 


Cys 


Thr 


Lys 


Ser 


Gly 


Gly 


Cys 


Val 


Ala 


Gin 


Asp 


Thr 


Ser 


Val 


Val 








20 










25 










30 






Leu 


Asp 


Trp 


Asn 


Tyr Arg 


Trp 


Met 


His 


Asp 


Ala 


Asn 


Tyr 


Asn 


Ser 


Cys 






35 










40 










45 






Thr 


Val 


Asn 


Gly 


Gly 


Val 


Asn 


Thr 


Thr 


Leu 


Cys 


Pro 


Asp 


Glu 


Ala 


Thr 
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50 










55 










60 












Gly Lys Asn 


Cys 


Phe 


lie 

-L -J- w 


Glu 


01 Y 


V CtX 


ASp 


Tyr 


Ala 


TV 1 -a 

Ala 


Q A V 


u»iy 


65 










70 










/ —J 










on 


V CI -i- 


Thr 


Thr 


Ser 


Gly 


Ser 


Ser 


Tip*! 1 


-L 11 J. 




A <~* T*l 


Gin 


Tyr 


piei- 


fro 


t>er 










85 










gn 














J- 


Ser 


Gly 


Gly 


Tyr 


Ser 


Ser 


v ai, 


C e» >- 
OCX 


rlU 


i\rg 


Leu 


Tyr 


XicU 


JjCU 


ASp 








100 










105 










110 

J- -l_ \s 






Ser 


Asp 


Gly 


Glu 


Tyr 


Val 


Met 


Leu 


TiVB 


T.pn 


Bon 


Gly Gin 




iJC u 


Cot* 
OCX 






115 










120 










X <S D 








Phe 


Asp 


Val 


Asp Leu 


Ser 


Ala 




DrA 

XT JL W 




ij±y 


bill 


Asn 


uiy 


C A *r* 

Del 






130 










135 










14 0 










ryr 


lieu 


Ser 


Gin 


Met 


Asp 


Glu 


Hen 


vjxy 




Ala 


Asn 


Gin 


Tyr 


Asn 


Thr 


TAR 

At* J 










150 










"ICE? 
1 J J 










IbU 




Gly Ala Asn Tyr 


Gly 


Ser 


tsxy 


Tyr 


Lys 


Asp 


Ala 


Gin 


Cys 


Pro 


Val 










165 










X / u 










X / z> 




bin 


Thr 


Trp Arg 


Asn 


Gly Thr 


Leu 


Asn 


Thr 


Ser 


His 


Gin 


Gly 


Phe 


Cys 








180 










1 ft R 










iy u 






tys 


Asn 


. Glu 


Met 


Asp 


lie 


Leu 


m , . 

ulu 


pi., 


Asn 


C* V 


Arg 


Ala 


Asn 


TV 1 — • 

Ala 


Leu 






195 










inn 
A\J\J 










205 








X XIX 


Pro 


His 


Ser 


Cys 


Thr 


Ala 


inr 


Aid 


t_y s 


Asp 


Ser 


Ala 


Gly 


Cys 


Gly 




210 










215 










220 










pne 


Asn 


Pro 


Tyr Gly 


Ser 


Gly 


Tyr 


Lys 


Ser 


Tyr 


Tyr 


Gly 


• 

Pro 


Gly 


Asp 


o o c 










230 










235 










24 0 


rri V» -u~ 

Tnr 


Val 


Asp 


Thr 


Ser 


Lys 


Thr 


Phe 


ml— 

Thr 


lie 


lie 


Thr 


Gin 


Phe 


Asn 


Thr 










245 










O c 

2 5 0 










255 




ASp 


Asn Gly 


Ser 


Pro 


Ser Gly 


Asn 


Leu 


Val 


Ser 


lie 


Thr 


Arg 


Lys 


Tyr 








260 










265 










270 






Gin 


Gin 


Asn 


Gly Val 


Asp 


lie 


Pro 


Ser 


Ala 


Gin 


Pro 


Gly 


Gly 


Asp 


Thr 






275 










280 










285 








lie 


Ser 


Ser 


Cys 


Pro 


Ser 


Ala 


Ser 


Ala 


Tyr 


Gly 


Gly 


Leu 


Ala 


Thr 


Met 




290 










295 










300 










Gly 


Lys 


Ala 


Leu 


Ser 


Ser 


Gly 


Met 


Val 


Leu 


Val 


Phe 


Ser 


lie 


Trp 


Asn 


305 










310 










315 










320 


Asp 


Asn 


Ser 


Gin 


Tyr 


Met 


Asn 


Trp 


Leu 


Asp 


Ser 


Gly Asn 


Ala 


Gly 


Pro 










325 










330 










335 




Cys 


Ser 


Ser 


Thr 


Glu 


Gly Asn 


Pro 


Ser 


Asn 


lie 


Leu 


Ala 


Asn 


Asn 


Pro 








340 










J ft D 
















Asn 


Thr 


His 


Val 


val 


Phe 


Ser 


HSU 


lie 


nig 


irp 


Gly Asp 


lie 


CsJLy 


oer 






355 




















365 








Thr 


Thr 
































370 






























<210> 34 




























<211> 223 




























<212> PRT 
















- 












<213> Aspergillus niger 




















<400> 34 




























Gin 


Thr 


Met 


Cys 


Ser 


Gin 


Tyr 


Asp 


Ser 


Ala 


Ser 


Ser 


Pro 


Pro 


Tyr 


Ser 


1 








5 










10 










15 




Val 


Asn 


Gin 


Asn 


Leu 


Trp 


Gly 


Glu 


Tyr 


Gin 


Gly 


Thr 


Gly 


Ser 


Gin 


Cys 








20 










25 










30 






Val 


Tyr 


Val 


Asp 


Lys 


Leu 


Ser 


Ser 


Ser 


Gly 


Ala 


Ser 


Trp 


His 


Thr 


Glu 






35 










40 










45 








Trp 


Thr 


Trp 


Ser 


Gly 


Gly 


Glu 


Gly 


Thr 


Val 


Lys 


Ser 


Tyr 


Ser 


Asn 


Ser 




50 










55 










60 










Gly Val 


Thr 


Phe 


Asn 


Lys 


Lys 


Leu 


Val 


Ser 


Asp 


Val 


Ser 


Ser 


lie 


Pro 


65 










70 










75 










80 


Thr 


Ser 


Val 


Glu 


Trp 


Lys 


Gin 


Asp 


Asn 


Thr 


Asn 


Val 


Asn 


Ala 


Asp 


Val 










85 










90 










95 




Ala 


Tyr 


Asp 


Leu 


Phe 


Thr 


Ala 


Ala 


Asn 


Val 


Asp 


His 


Ala 


Thr 


Ser 


Ser 








100 










105 










110 
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Gly Asp Tyr Glu 
115 

Pro lie Gly Lys 
130 

Glu Val Trp Tyr 
145 

Tyr Ser Phe Val 

Asn Ala Phe Phe 

180 

Ser Gin Tyr Leu 
195 

Gly Pro Ala Thr 
210 



Leu Met lie Trp 

120 

Gin lie Ala Thr 
135 

Gly Ser Thr Thr 
150 

Ser Glu Ser Pro 
165 

Ser Tyr Leu Thr 

lie Asn Leu Gin 

200 

Phe Thr Val Asp 
215 



Leu Ala Arg Tyr 

Ala Thr Val Gly 

140 

Gin Ala Gly Ala 
155 

lie Asn Ser Tyr 
170 

Gin Asn Gin Gly 
185 

Phe Gly Thr Glu 

Asn Trp Thr Ala 

220 



Gly Asn lie Gin 
125 

Gly Lys Ser Trp 

Glu Gin Arg Thr 

160 

Ser Gly Asp lie 
175 

Phe Pro Ala Ser 
190 

Ala Phe Thr Gly 
205 

Ser Val Asn 



<210> 35 

<211> 184 

<212> PRT 

<213> Aspergillus niger 



<400> 35 



Ser 


Ala 


Gly 


He 


Asn 


Tyr 


Val 


Gin 


Asn 


Tyr 


Asn 


Gly 


Asn 


Leu 


Gly 


Asp 


1 








5 










10 










15 




Phe 


Thr 


Tyr 


Asp 


Glu 


Ser 


Ala 


Gly 


Thr 


Phe 


Ser 


Met 


Tyr 


Trp 


Glu 


Asp 








20 










25 










30 






Gly 


val 


Ser 


Ser 


ASp 


Phe 


Val 


val 


Gly 


Leu 


Gly 


Trp 


Thr 


Thr 


Gly 


Ser 






35 










40 










45 








Ser 


Asn 


Ala 


He 


Thr 


Tyr 


Ser 


Ala 


Glu 


Tyr 


Ser 


Ala 


Ser 


Gly 


Ser 


Ala 




50 










55 










60 










Ser 


Tyr 


Leu 


Ala 


Val 


Tyr 


Gly Trp 


Val 


Asn 


Tyr 


Pro 


Gin 


Ala 


Glu 


Tyr 


65 










-70 










75 










80 


Tyr 


He 


Val 


Glu 


Asp 


Tyr 


Gly Asp 


Tyr Asn 


Pro 


Cys 


Ser 


Ser 


Ala 


Thr 










85 










90 










95 




Ser 


Leu 


Gly 


Thr 


Val 


Tyr 


Ser 


Asp 


Gly 


Ser 


Thr 


Tyr 


Gin 


Val 


Cys 


Thr 








100 










105 










110 






Asp 


Thr 


Arg 


Thr 


Asn 


Glu 


Pro 


Ser 


He 


Thr Gly Thr 


Ser 


Thr 


Phe 


Thr 






115 










120 










125 








Gin 


Tyr 


Phe 


Ser 


Val 


Arg 


Glu 


Ser 


Thr 


Arg 


Thr 


Ser 


Gly 


Thr 


Val 


Thr 




130 










135 










140 










Val 


Ala 


Asn 


His 


Phe 


Asn 


Phe 


Trp 


Ala 


His 


His 


Gly 


Phe 


Gly 


Asn 


Ser 


145 










150 










155 










160 


Asp 


Phe 


Asn 


Tyr 


Gin 


Val 


Val 


Ala 


Val 


Glu 


Ala 


Trp 


Ser 


Gly 


Ala 


Gly 










165 










170 










175 




Ser 


Ala 


Ser 


Val 


Thr 


He 


Ser 


Ser 



















180 



<210> 36 
<211> 313 
<212> PRT 

<213> Streptomyces lividans 
<400> 36 

Ala Glu Ser Thr Leu Gly Ala Ala Ala Ala Gin Ser Gly Arg Tyr Phe 

15 10 15 

Gly Thr Ala He Ala Ser Gly Arg Leu Ser Asp Ser Thr Tyr Thr Ser 

20 25 30 

He Ala Gly Arg Glu Phe Asn Met Val Thr Ala Glu Asn Glu Met Lys 

35 40 45 

He Asp Ala Thr Glu Pro Gin Arg Gly Gin Phe Asn Phe Ser Ser Ala 

50 55 60 

Asp Arg Val Tyr Asn Trp Ala Val Gin Asn Gly Lys Gin Val Arg Gly 
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65 70 



His 


Thr 


Leu 


Ala 


Trp 


His 


Ser 


Gin 


Gin 


Pro 










85 










90 


c_3 vZ3 JL 


Glv 


Ser 


Ala 


Leu 




Gin 


Ala 


Met 


lie 








100 










105 




Met 


Ala 


His 


Tyr 


Lys 


Glv 
j 




lie 


Val 


Gin 






115 










120 






Ala 


Phe 


Ala 


Asp 


Gly 


Ser 

fcj w JL 


Ser 


Gly Ala Arg 




130 










135 

JL _J M* 










Ser 


Gly Asn Asp 


Tm 


lie 


Glu 


Val 


ax a 


X*± D 










T C A 
IjU 














Pro 


Ser 


Ala 


iiys 




Cys 


Tyr 


Asn 










165 










170 


xrp 


Thr 


Trp 


Ala 


Lys 


j. xii. 


O J- 1 1 


Ala 


Met 


Tyr 








180 










185 




T ,UQ 


V3JL11 


Arg 


Gly Val 




Tip 

X X c 


Asp 


Cys 


Val 






195 










200 






Asn 


Ser 


Gly Ser 


Pro 


Tyr 


Asn 


Ser 


Asn 


Phe 




210 










215 








Phe 


Ala 


Ala 


Leu Gly 


Val 


Asp 


Val 


Ala 


lie 


225 










230 










Gly 


Ala 


Pro 


Ala 


Ser 


Thr 


Tyr 


Ala 


Asn 


Val 










245 










250 


Val 


Ser 


Arg 


Cys 


Leu 


Gly 


lie 


Thr 


Val 


Trp 








260 










265 




Ser 


Trp 


Arg 


Ser 


Glu 


Gin 


Thr 


Pro 


Leu 


Leu 






275 










280 






Lys 


Lys 


Ala 


Ala 


Tyr 


Thr 


Ala 


Val 


Leu 


Asp 




290 










295 








Ser 


Ser 


Glu 


Pro 


Pro 


Ala 


Asp 


Gly Gly 




305 










310 











<210> 37 

<211> 362 

<212> PRT 

<213> Aspergillus niger 



<400> 37 



Met 


His 


Ser 


Phe 


Ala 


Ser 


Leu 


Leu Ala Tyr 


1 








5 






10 


Thr 


Phe 


Ala 


Ser 


Ala 


Ser 


Pro 


He Glu Ala 








20 








25 


Thr 


Thr 


Ala 


Ala 


Ala 


Ala 


Lys 


Ala Gly Lys 






35 










40 


Thr 


Leu 


Asn 


Asn 


lie 


Glu 


Val 


Pro Ala Gly 




50 










55 




Gly 


Leu 


Thr 


Ser 


Gly 


Thr 


Lys 


Val He Phe 


65 










70 






Gin 


Tyr 


Glu 


Glu 


Trp 


Ala 


Gly 


Pro Leu He 










85 






90 


lie 


Thr 


val 


Thr 


Gly 


Ala 


Ser 


Gly His Leu 








100 








105 


Arg 


Trp 


Trp 


Asp 


Gly 


Lys 


Gly 


Thr Ser Gly 






115 










120 


Phe 


Tyr 


Ala 


His 


Gly 


Leu 


Asp 


Ser Ser Ser 




130 










135 




Lys 


Asn 


Thr 


Pro 


Leu 


Met 


Ala 


Phe Ser Val 


145 










150 






Phe 


Thr 


Asp 


Val 


Thr 


He 


Asn 


Asn Ala Asp 










165 






170 


His 


Asn 


Thr 


Asp 


Ala 


Phe 


Asp 


Val Gly Asn 








180 








185 
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75 










80 


Gly Trp 


Met 


Gin 




T ,<=>i t 
















His 


lie 


Asn 




v d -L 








110 






Trp 


Asp 


Val 


Val 


IX <3 Tl 


V3 J. U 






125 








Arq 


Asp 


Ser 


Asn 




Gin 




140 










Phe 


Arg. Thr Ala 




Ala 


155 










160 




Tyr 


Asn 


Val 




T\ q y\ 










lie 

JL 1 -J 




Asn 


Met 


Val 


Arg 




irne 








190 






Gly 


Phe 


Gin 


Ser 


His 


Phe 






205 








Arg 


Thr 


Thr 


Leu 


Gin 


Asn 




220 










Thr 


Glu 


Leu 


Asp 


He 


Gin 


235 










240 


Thr 


Asn 


Asp 


Cys 


Leu 


Ala 










255 




Gly 


Val 


Arg 


Asp 


Ser 


Asp 








270 






Phe 


Asn 


Asn Asp 


Gly 


Ser 






285 








Ala 


Leu 


Asn 


Gly 


Gly 


Ala 



300 



Gly Leu 


Val 


Ala 


Gly 


Ala 










15 




Arg 


Asp 


Ser 


Cys 


Thr 


Phe 








30 






Ala 


Lys 


Cys 


Ser 


Thr 


He 






45 








Thr 


Thr 


Leu 


Asp 


Leu 


Thr 




60 










Glu 


Gly 


Thr 


Thr 


Thr 


Phe 


75 










80 


Ser 


Met 


Ser 


Gly 


Glu 


His 










95 




He 


Asn 


Cys 


Asp 


Gly 


Ala 








110 






Lys 


Lys 


Lys 


Pro 


Lys 


Phe 






125 








He 


Thr 


Gly 


Leu 


Asn 


He 




140 










Gin 


Ala 


Asn 


Asp 


He 


Thr 


155 










160 


Gly Asp 


Thr 


Gin 


Gly 


Gly 










175 




Ser 


Val 


Gly 


Val 


Asn 


He 



190 
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lie 


Lys 


Pro 


Trp 


Val 


His Asn Gin Asp Asp Cys Leu Ala 


Val 


Asn 


Ser 






195 










200 








205 








Glv 


Glu 


Asn 


He 


Trp 


Phe 


Thr Gly Gly Thr 


Cys 


He Gly Gly His 


Glv 




210 










215 










A A \J 








lieu 


Ser 


He 


Gly 


Ser 


Val 


Gly Asp Arg 


Ser 


Asn 


Asn Val 


Val 


Lys 


Asn 

■rats J. -i 


225 










230 










235 








240 


Val 


Thr 


He 


Glu 


His 
245 


Ser 


Thr 


Val 


Ser 


Asn 
250 


Ser 


Glu Asn 


Ala 


Val 
255 


Arg 


Tie 

-L -L C 




Thr 


He 


Ser Gly Ala Thr Gly 


Ser 


Val 


Ser Glu 


He 


Thr 










260 










265 








270 






Ser 


Asn 


He 
275 


Val 


Met 


Ser 


Gly 


He 
280 


Ser 


Asp 


Tyr 


Gly Val 
285 


Val 


He 


Gin 


Gin 


Asp 


Tyr 


Glu Asp Gly Lys 


Pro 


Thr 


Gly 


Lys 


Pro Thr 


Asn 


Gly 


Val 




290 










295 










300 








Thr 


He 


Gin 


Asp 


Val 


Lys 


Leu 


Glu 


Ser 


Val 


Thr 


Gly Ser 


Val 


Asp 


Ser 


305 










310 










315 








320 


Gly 


Ala 


Thr 


Glu 


He 
325 


Tyr 


Leu 


Leu 


Cys 


Gly 
330 


Ser 


Gly Ser 


Cys 


Ser 
335 


Asp 


Trp 


Thr 


Trp 


Asp 
340 


Asp 


Val 


Lys 


Val 


Thr 
345 


Gly 


Gly 


Lys Lys 


Ser 
350 


Thr 


Ala 


Cys 


Lys 


Asn 
355 


Phe 


Pro 


Ser 


Val 


Ala 
360 


Ser 


Cys 













<210> 38 

<211> 383 

<212> PRT 

<213> Pseudomonas cellulosa 



<400> 38 



Arg 


Ala 


Asp 


Val 


Lys 


Pro 


Val 


Thr 


Val 


Lys 


Leu 


Val 


Asp 


Ser 


Gin 


Ala 


1 








5 










10 










15 




Thr 


Met 


Glu 


Thr 


Arg 


Ser 


Leu 


Phe 


Ala 


Phe 


Met 


Gin 


Glu 


Gin 


Arg 


Arg 








20 










25 










30 






His 


Ser 


He 


Met 


Phe 


Gly 


His 


Gin 


His 


Glu 


Thr 


Thr 


Gin 


Gly 


Leu 


Thr 






35 










40 










45 








lie 


Thr 


Arg 


Thr 


Asp 


Gly 


Thr 


Gin 


Ser 


Asp 


Thr 


Phe 


Asn 


Ala 


Val 


Gly 




50 










55 










60 










Asp 


Phe 


Ala 


Ala 


Val 


Tyr 


Gly 


Trp Asp 


Thr 


Leu 


Ser 


He 


Val 


Ala 


Pro 


65 










70 










75 










80 


Lys 


Ala 


Glu 


Gly 


Asp 


He 


Val 


Ala 


Gin 


Val 


Lys 


Lys 


Ala 


Tyr 


Ala 


Arg 










85 










90 










95 




Gly 


Gly 


He 


He 


Thr 


Val 


Ser 


Ser 


His 


Phe 


Asp 


Asn 


Pro 


Lys 


Thr 


Asp 








100 










105 










110 






Thr 


Gin 


Lys 


Gly 


Val 


Trp 


Pro 


Val 


Gly 


Thr 


Ser 


Trp 


Asp 


Gin 


Thr 


Pro 






115 










120 










125 








Ala 


Val 


Val 


Asp 


Ser 


Leu 


Pro 


Gly Gly Ala 


Tyr 


Asn 


Pro 


Val 


Leu 


Asn 




130 










135 










140 










Gly 


Tyr 


Leu 


Asp 


Gin 


Val 


Ala 


Glu 


Trp 


Ala 


Asn 


Asn 


Leu 


Lys 


Asp 


Glu 


145 










150 










155 










160 


Gin 


Gly 


Arg 


Leu 


He 


Pro 


Val 


He 


Phe 


Arg 


Leu 


Tyr 


His 


Ala 


Asn 


Thr 










165 










170 










175 




Gly 


Ser 


Trp 


Phe 


Trp 


Trp 


Gly 


Asp 


Lys 


Gin 


Ser 


Thr 


Pro 


Glu 


Gin 


Tyr 








180 










185 










190 






Lys 


Gin 


Leu 


Phe 


Arg 


Tyr 


Ser 


val 


Glu 


Tyr 


Leu 


Arg 


Asp 


Val 


Lys 


Gly 






195 










200 










205 








Val 


Arg 


Asn 


Phe 


Leu 


Tyr 


Ala 


Tyr 


Ser 


Pro 


Asn 


Asn 


Phe 


Trp 


Asp 


Val 




210 










215 










220 










Thr 


Glu 


Ala 


Asn 


Tyr 


Leu 


Glu 


Arg 


Tyr 


Pro 


Gly 


Asp 


Glu 


Trp 


Val 


Asp 


225 










230 










235 










240 


Val 


Leu 


Gly 


Phe 


ASp 


Thr 


Tyr 


Gly 


Pro 


Val 


Ala 


Asp 


Asn 


Ala 


Asp 


Trp 










245 










250 










255 




Phe 


Arg 


Asn 


Val 


Val 


Ala 


Asn 


Ala 


Ala 


Leu 


Val 


Ala 


Arg 


Met 


Ala 


Glu 
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260 265 270 





+ a. y 


Gly Lys 


Tie 
■i* j. \_* 




Val 

V CL -1* 


He 


Ser 


al ii 


lie 


uj. y 


He Arg Ala 


t*ro 






275 










280 


















Asp 


lie 


Glu 


Ala 


ai v 




TH/r 

xy t. 


Asp Asn 


ai t~i 

V3 J. 11 


irp 


ryr 


ax y 


Ti\ra 

uy a 


i-icU 






290 










-7 «J 










inn 










Ser 


Gly 


Leu 


Lys 


Ai a 
nit* 






Asp Ala 


«rg 


vjlU 


Tip 

lie 


Ala 


Phe 


Leu 


lieu 


305 










310 










315 










320 


Val 


Trp 


Arg 


Asn 


Ala 


Pro 


Gin 


Gly Val 


Pro 


Gly 


Pro 


Acfi 
nbii 


Gl v 
%j j. y 


J. III- 


Gin 










325 










J J u 










335 

—J 




Val 


Pro 


His 


Tyr 


Trp 


Val 


Pro 


Ala 


Asn 




Prn 


Glu 




Tip 

A J- C 


Ann 
noil 










340 










345 
















Gly 


Thr 


Leu 


Glu 


Asp 


Phe 


Gin 


Ala 


Phe 


Tyr 


Ala 


Asp 


u±u 


trne 


inr 


±\± a 






355 










360 










ICC 

J o _> 








Phe 


Asn 


Arg 


Asp 


He 


Glu 


Gin 


Val 


Tyr 


Gin 


Arg 


Pro 


T 1 Vi -v* 


Leu 


I j.e 






370 










375 










380 










<210> 39 




























<211> 419 




























<212> PRT 




























<213> Bacillus 


circulans 




















<400> 39 




























Lretl 


Gin 


Pro 


Ala 


Thr 


Ala 


Glu 


Ala 


Ala 


Asp 


Ser 


Tyr 


Lys 


He 


Val 


Gly 


1 








5 










10 










15 




Tyr 


Tyr 


7™""> y^-v 

.pro 


Ser 


Trp 


Ala 


Ala 


Tyr Gly Arg Asn Tyr 


Asn 


Val 


Ala 


Asp 








20 










25 










30 






lie 


Asp 


Pro 


Thr 


Lys 


Val 


Thr 


His 


He 


Asn 


Tyr 


Ala 


Phe 


Ala 


Asp 


lie 






35 










40 










45 








Cys 


Trp 


Asn 


Gly 


He 


His 


Gly 


Asn 


Pro 


Asp 


Pro 


Ser 


Gly 


Pro 


Asn 


Pro 




50 










55 










60 










Val 


Thr 


Trp 


Thr 


Cys 


Gin 


Asn 


Glu 


Lys 


Ser 


Gin 


Thr 


He 


Asn 


Val 


Pro 


65 










70 










75 










80 


Asn 


Gly 


Thr 


He 


Val 


Leu 


Gly 


Asp 


Pro 


Trp 


He 


Asp 


Thr Gly 


Lys 


Thr 










85 










90 










95 




Phe 


Ala 


Gly 


Asp 


Thr 


Trp 


Asp 


Gin 


Pro 


He 


Ala 


Gly 


Asn 


He 


Asn 


Gin 








100 










105 










110 






Leu 


Asn 


Lys 


Leu 


Lys 


Gin 


Thr 


Asn 


Pro 


Asn 


Leu 


Lys 


Thr 


He 


He 


Ser 






115 










120 










125 








Val 


Gly 


Gly 


Trp 


Thr 


Trp 


Ser 


Asn 


Arg 


Phe 


Ser 


Asp 


Val 


Ala 


Ala 


Thr 




130 










135 










140 










Ala 


Ala 


Thr 


Arg 


Glu 


Val 


Phe 


Ala 


Asn 


Ser 


Ala 


Val 


Asp 


Phe 


Leu 


Arg 


145 










150 










155 










160 


Lys 


Tyr 


Asn 


Phe 


Asp 


Gly 


Val 


Asp 


Leu 


Asp 


Trp 


Glu 


Tyr 


Pro 


Val 


Ser 








■ 


165 










170 










175 




Gly 


Gly 


Leu 


Asp 


Gly 


Asn 


Ser 


Lys 


Arg 


Pro 


Glu 


Asp 


Lys 


Gin 


Asn 


Tyr 








180 










185 










190 






Thr 


Leu 


Leu 


Leu 


Ser 


Lys 


He 


Arg 


Glu 


Lys 


Leu 


Asp 


Ala 


Ala 


Gly Ala 






195 










200 










205 








Val 


Asp 


Gly 


Lys 


Lys 


Tyr 


Leu 


Leu 


Thr 


He 


Ala 


Ser 


Gly Ala 


Ser 


Ala 




210 










215 










220 










Thr 


Tyr 


Ala 


Ala 


Asn 


Thr 


Glu 


Leu 


Ala 


Lys 


He 


Ala 


Ala 


He 


Val 


Asp 


225 










230 










235 










240 


Trp 


lie 


Asn 


He 


Met 


Thr 


Tyr 


Asp 


Phe Asn Gly Ala 


Trp 


Gin 


Lys 


He 










245 










250 










255 




Ser 


Ala 


His 


Asn 


Ala 


Pro 


Leu 


Asn 


Tyr 


Asp 


Pro 


Ala 


Ala 


Ser 


Ala 


Ala 








260 










265 










270 






Gly 


Val 


Pro 


Asp 


Ala 


Asn 


Thr 


Phe 


Asn 


Val 


Ala 


Ala 


Gly Ala 


Gin 


Gly 






275 










280 










285 








His 


Leu 


Asp 


Ala 


Gly 


Val 


Pro 


Ala 


Ala 


Lys 


Leu 


Val 


Leu 


Gly 


Val 


Pro 




290 










295 










300 










Phe 


Tyr 


Gly 


Arg 


Gly 


Trp 


Asp 


Gly 


Cys 


Ala 


Gin 


Ala 


Gly Asn 


Gly 


Gin 


305 










310 










315 










320 
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Tyr 


Gin 


Thr 


Cys 


Thr Gly Gly 


Ser 


Ser Val Gly Thr Trp Glu 


Ala 


Gly 










325 




330 


335 




Ser 


Phe 


Asp 


Phe 


Tyr Asp Leu 


Glu 


Ala Asn Tyr lie Asn Lys 


Asn 


Gly 








340 






345 350 






Tyr 


Thr 


Arg 


Tyr 


Trp Asn Asp 


Thr 


Ala Lys Val Pro Tyr Leu 


Tyr 


Asn 






355 






360 


365 






Ala 


Ser 


Asn 


Lys 


Arg Phe lie 


Ser 


Tyr Aep Asp Ala Glu Ser 


Val 


Gly 




370 






375 




380 






Tyr 


Lys 


Thr 


Ala 


Tyr lie Lys 


Ser 


Lys Gly Leu Gly Gly Ala 


Met 


Phe 


385 








390 




395 




400 


Trp 


Glu 


Leu 


Ser 


Gly Asp Arg 


Asn 


Lys Thr Leu Gin Asn Lys 


Leu 


Lys 










405 




410 


415 




Ala 


Asp 


Leu 















<210> 40 

<211> 317 

<212> PRT 

<213> Candida antarctica 



<400> 40 

Leu Pro Ser Gly 

1 

Asp Ala Gly Leu 

20 

Pro lie Leu Leu 
35 

Asp Ser Asn Trp 
50 

Trp He Ser Pro 
65 

Glu Tyr Met Val 

Asn Lys Leu Pro 

100 

Trp Gly Leu Thr 
115 

Met Ala Phe Ala 
130 

Asp Ala Leu Ala 
145 

Ser Ala Leu Thr 

Val Pro Thr Thr 

180 

Gin Val Ser Asn 
195 

Asn Val Gin Ala 
210 

Ala Gly Ser Leu 
225 

Leu Arg Ser Thr 

Asp Cys Asn Pro 

260 

Ala Ala Ala Ala 
275 

Pro Lys Gin Asn 
290 

Ala Val Gly Lys 
305 



Ser Asp Pro Ala 

5 

Thr Cys Gin Gly 

Val Pro Gly Thr 

40 

He Pro Leu Ser 
55 

Pro Pro Phe Met 
70 

Asn Ala He Thr 
85 

Val Leu Thr Trp 

Phe Phe Pro Ser 

120 

Pro Asp Tyr Lys 
135 

Val Ser Ala Pro 
150 

Thr Ala Leu Arg 
165 

Asn Leu Tyr Ser 

Ser Pro Leu Asp 

200 

Gin Ala Val Cys 
215 

Thr Ser Gin Phe 
230 

Thr Gly Gin Ala 
245 

Leu Pro Ala Asn 

Leu Leu Ala Pro 

280 

Cys Glu Pro Asp 
295 

Arg Thr Cys Ser 
310 



Phe Ser Gin Pro 
10 

Ala Ser Pro Ser 
25 

Gly Thr Thr Gly 

Thr Gin Leu Gly 

60 

Leu Asn Asp Thr 
75 

Ala Leu Tyr Ala 
90 

Ser Gin Gly Gly 
105 

He Arg Ser Lys 

Gly Thr Val Leu 

140 

Ser Val Trp Gin 
155 

Asn Ala Gly Gly 
170 

Ala Thr Asp Glu 
185 

Ser Ser Tyr Leu 

Gly Pro Leu Phe 

220 

Ser Tyr Val Val 
235 

Arg Ser Ala Asp 
250 

Asp Leu Thr Pro 
265 

Ala Ala Ala Ala 

Leu Met Pro Tyr 

300 

Gly lie Val Thr 
315 



Lys Ser Val Leu 
15 

Ser Val Ser Lys 
30 

Pro Gin Ser Phe 
45 

Tyr Thr Pro Cys 

Gin Val Asn Thr 

80 

Gly Ser Gly Asn 
95 

Leu Val Ala Gin 
110 

Val Asp Arg Leu 
125 

Ala Gly Pro Leu 

Gin Thr Thr Gly 

160 

Leu Thr Gin He 
175 

He Val Gin Pro 
190 

Phe Asn Gly Lys 
205 

Val He Asp His 

Gly Arg Ser Ala 

240 

Tyr Gly He Thr 
255 

Glu Gin Lys Val 
270 

He Val Ala Gly 
285 

Ala Arg Pro Phe 
Pro 
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<210> 41 

<211> 434 

<212> PRT 

<213> artificial sequence 
<220> 

<223> chimera of guinea pig and homo sapiens (human= approx . last 30 am 
ino acids) 



<400> 41 

Ala Glu Val Cys 

1 

Trp Ala Gly Thr 

20 

Lys Lys lie Asn 
35 

Ser Tyr Gin Lieu 
50 

Asn Phe Asn Leu 
65 

Asp Ser Gly Glu 

Gin Val Glu Lys 

100 

Lys Ala Gin Tyr 
115 

Glu Val Ala Tyr 
130 

Pro Glu Asn Val 
145 

Gly Glu Ala Gly 

Leu Asp Pro Ala 

180 

Leu Asp Pro Ser 
195 

Ser Pro lie Leu 
210 

His Met Asp Phe 
225 

Thr Gly He Ser 

Ser He Leu Asn 

260 

Asp Glu Phe Gin 
275 

Pro Lys Met Gly 
290 

Val Glu Gin Thr 
305 

Arg Trp Arg Tyr 

Ser Gly Asn He 

340 

Gin Tyr Gin Val 
355 

Asn Ser lie Asp 
370 

Phe Leu Trp Lys 
385 

Ala Ser Arg lie 
Phe Cys Ser Ser 



Tyr Ser His Leu 
5 

Ser Gin Arg Pro 

Thr Arg Phe Leu 

40 

He Thr Ala Thr 

55 

Asn Arg Lys Thr 
70 

Asn Ser Trp Leu 
85 

Val Asn Cys lie 

Ser Gin Ala Ser 

120 

Leu Val Gin Val 
135 

His He He Gly 
150 

Lys Arg Leu Asn 
165 

Glu Pro Tyr Phe 

Asp Ala Lys Phe 

200 

Pro Ser Leu Gly 
215 

Phe Pro Asn Gly 
230 

Cys Asn His His 
245 

Pro Glu Gly Phe 

Glu Ser Gly Cys 

280 

His Phe Ala Asp 
295 

Phe Phe Leu Asn 
310 

Lys Val Thr Val 
325 

Asn Val Ala Leu 

Phe Lys Gly Thr 

360 

Val Glu Leu Asn 
375 

Arg Ser Gly He 
390 

Thr Val Gin Ser 
405 

Asp lie Val Gin 



Gly 


Cys 


Phe 


Ser 




10 






He 


Lys 


Ser 


Leu 


o c 
2b 








Leu 


Tyr 


Thr 


Asn 


Asp 


He 


Ala 


Thr 








60 


Arg 


Phe 


He 


lie 






7 5 




Ser 


Asp 


Met 


Cys 




90 






Cys 


Val 


Asp 


Trp 


105 








Gin 


Asn 


He 


Arg 


Leu 


Ser 


Thr 


Ser 








140 


HXS 


Ser 


Leu 


Gly 






1 r r 

15 5 




Gly 


Leu 


Val 


Gly 




170 






Gin 


Asp 


Thr 


Pro 


185 








Val 


Asp 


Val 


He 


Phe 


Gly 


Met 


Ser 










Gly 


Lys 


Asp 


Met 






235 




Arg 


Ser 


He 


Glu 




250 






Leu 


Gly 


Tyr 


Pro 


265 








Phe 


Pro 


Cys 


Pro 


Gin 


Tyr 


Pro 


Gly 








300 


Thr 


Gly 


Ala 


Ser 






315 




Thr 


Leu 


Ser 


Gly 




330 






Leu 


Gly 


Lys 


Asn 


345 








Leu 


Lys 


Pro 


Asp 


Val 


Gly 


Thr 


He 








380 


Ser 


Val 


Ser 


Lys 






395 




Gly 


Lys 


Asp 


Gly 




410 






Glu 


Asn 


Val 


Glu 



Asp 


Glu 


Lys 


Pro 






15 




Pro 


Ser 


— - . 

Asp 


Pro 




30 






Glu 


Asn 


Gin 


Asn 


A C 

4 5 








He 


Lys 


Ala 


Ser 


His 


Gly 


Phe 


Thr 








80 


Lys 


Asn 


Met 


Phe 






95 




Lys 


Gly 


Gly 


Ser 




110 






Val 


Val 


Gly 


Ala 


125 








Leu 


Asn 


Tyr 


Ala 


Ala 


His 


Thr 


Ala 








160 


Arg 


He 


Thr 


Gly 






175 




Glu 


Glu 


Val 


Arg 




190 






His 


Thr 


Asp 


He 


one 








Gin 


Lys 


Val 


Gly 


Pro 


Gly 


Cys 


Lys 








240 


Tyr 


Tyr 


His 


Ser 






255 




Cys 


Ala 


Ser 


Tyr 




270 






Ala 


Lys 


Gly 


Cys 


285 








Lys 


Thr 


Asn 


Ala 


Asp 


Asn 


Phe 


Thr 








320 


Glu 


Lys 


Asp 


Pro 






335 




Gly 


Asn 


Ser 


Ala 




350 






Ala 


Ser 


Tyr 


Thr 


365 








Gin 


Lys 


Val 


Thr 


Pro 


Lys 


Met 


Gly 








400 


Thr 


Lys 


Tyr 


Asn 






415 




Gin 


Thr 


Leu 


Ser 
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420 

Pro Cys 



<210> 42 

<211=> 471 

<212> PRT 

<213> Escherichia coli 



<400> 42 



Met 


Lys 


Gin 


Ser 


Thr 


He 


1 








5 




Pro 


Val 


Thr 


Lys 


-m. ^ 

Ala 


Arg 








20 






Ala 


Ala 


Gin 


Gly 


Asp 


He 






35 








Gly 


H. M 

Asp 


Gin 


Thr 


-i _ 
Ala 


Ala 




50 










Lys 


Asn 


He 


He 


Leu 


Leu 


65 










70 


Thr 


Ala 


Ala 


Arg 


Asn 


Tyr 










85 




lie 


Asp 


Ala 


Leu 


Pro 


Leu 








100 






Lys 


Lys 


Thr 


Gly 


Lys 


Pro 






115 








Thr 


Ala 


Trp 


Ser 


Thr 


Gly 




130 










Asp 


He 


His 


Glu 


Lys 


Asp 


145 










150 


Ala 


Gly 


Leu 


Ala 


Thr 


Gly 










165 




Thr 


Pro 


Ala 


Ala 


Leu 


Val 








180 






Pro 


Ser 


Ala 


Thr 


Ser 


Glu 






195 








vjiy 


Lys 


Gly 


Ser 


lie 


Thr 




210 










Thr 


Leu 


Gly 


Gly 


Gly 


Ala 


225 










230 


Glu 


Trp 


Gin 


Gly 


Lys 


Thr 










245 




Gin 


Leu 


Val 


Ser 


Asp 


Ala 








260 






Gin 


Gin 
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Leu 










125 








Val 


Thr 


Gly 


Asp 


Gin 


Gly 


He 


Gly 








140 










Thr 


Ala 


Lys 


Ala 


Met 


Ala 


Phe 


Tyr 






155 










160 


Ser 


Asp 


He 


He 


Asp 


He 


Gin 


Met 




170 










175 




His 


Phe 


Leu 


His 


Cys 


Asn 


Gly 


Arg 


185 










190 






Phe 


Pro 


lie 


Pro 


Lys 


Arg 


He 


His 










205 








lie 


Asp 


Asp 


Val 


Gly 


Tyr 


Ala 


Phe 








220 










He 


Thr 


Ser 


Leu 


Leu Gly Arg 


His 






235 










240 


Tyr 


Ala 


Asp 


Thr 


Pro 


Ser 


Pro 


Met 




250 










255 




Pro 


Arg 


Thr 


Val 


Asp 


Ser 


Ser 


Trp 


265 










270 






Ala 


Met 


Trp 


Gly His 


Lys 


Ser 


Val 



285 
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<210> 47 

<211> 311 

<212> PRT 

<213> Acitenobacter sp . 



<400> 47 

Met Glu Val Lys 

1 

Val Ala Ser Gly 

20 

lie He His Arg 

35 

Asn He Thr Ser 
50 

Leu Gly Ala Asn 
65 

Asp His Tyr Leu 

He Glu Asn Ala 

100 

Gly Ala Pro Glu 
115 

Pro Asn Gly His 
130 

Gly Lys Pro Leu 
145 

Lys Gly Phe Tyr 

Asn Met Arg Arg 

180 

Arg Thr lie Leu 
195 

Gin Gin Leu Leu 
210 

He His Tyr Phe 
225 

He Asn Val Ala 

Thr Arg Glu Gly 

260 

Ala He Lys Ala 
275 

Asp Leu Lys Leu 
290 

Asp Arg Pro Arg 
305 



He Phe Asn Thr 
5 

Leu Glu Gin Glu 

Val Leu Ser Asp 

40 

Asp Glu Tyr Trp 
55 

Gin Glu Ala Gly 
70 

Asp Met Arg Met 
85 

Thr Pro Arg Thr 

Ser Val Gly Tyr 

120 

Thr Leu He Leu 
135 

Pro Asn Ala Lys 
150 

Ser His Phe Asp 
165 

Ser He He Thr 

Pro Ala Gly Tyr 

200 

Asn Gin Leu Gly 
215 

Val Ser Ala Asp 
230 

Gly Asp Pro Tyr 
245 

Leu Val Val Asp 

Asn Asp Val Glu 

280 

Thr Arg Leu Val 
295 

Leu Ala val 
310 



Gin Asp Val Gin 
10 

Gly Gly Asn Pro 
25 

Leu Tyr Lys Ala 

Ala Gly Val Ala 

60 

Leu Leu Ser Pro 
75 

Asp Ala Glu Asp 
90 

lie Glu Gly Pro 
105 

Ala Arg Met Asp 

His Gly Thr He 

140 

Val Glu He Trp 
155 

Pro Thr Gly Glu 
170 

Asp Glu Asn Gly 
185 

Gly Cys Pro Pro 

Arg His Gly Asn 

220 

Gly His Arg Lys 
235 

Thr Tyr Asp Asp 
250 

Ala Val Glu His 
265 

Gly Pro Phe Ala 

Asp Gly Val Asp 

300 



ASp 


irne 


Lieu 


Arg 






15 




Arg 


vai 


Lys 


Gin 




o U 






lie 


Glu 


Asp 


Leu 










Tyr 


Leu 


Asn 


Gin 


Gly 


Leu 


Gly 


Phe 








80 


Ala 


Ala 


Leu 


Gly 






95 




Leu 


Tyr 


Val 


Ala 




110 






Asp 


Gly 


Ser 


Asp 


125 








Phe 


Asp 


Ala 


Asp 


His 


Ala 


Asn 


Thr 








160 


Gin 


Gin 


Ala 


Phe 






175 




Gin 


Tyr 


Arg 


Val 




190 






Glu 


Gly 


Pro 


Thr 


205 








Arg 


Pro 


Ala 


His 


Leu 


Thr 


Thr 


Gin 








240 


Phe 


Ala 


Tyr 


Ala 






255 




Thr 


Asp 


Pro 


Glu 




270 






Glu 


Met 


Val 


Phe 


285 








Asn 


Gin 


Val 


Val 



<210> 48 
<211> 414 
<212> PRT 

<213> Pseudomonas putida 
<400> 48 

Thr Thr Glu Thr He Gin Ser Asn 
1 5 
His val Pro Glu His Leu val Phe 

20 

Asn Leu Ser Ala Gly Val Gin Glu 

3 5 4 0 

Asn Val Pro Asp Leu Val Trp Thr 

50 55 
Ala Thr Arg Gly Gin Leu lie Arg 



Ala Asn Leu Ala Pro Leu Pro Pro 

10 15 
Asp Phe Asp Met Tyr Asn Pro Ser 
25 30 
Ala Trp Ala Val Leu Gin Glu Ser 

45 

Arg Cys Asn Gly Gly His Trp lie 

60 

Glu Ala Tyr Glu Asp Tyr Arg His 
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65 

Phe Ser Ser Glu 

Asp Phe lie Pro 

100 

Ala Leu Ala Asn 
115 

Asn Arg lie Gin 
130 

Gin Gly Gin Cys 
145 

Arg lie Phe Met 

Leu Lys Tyr Leu 

180 

Phe Ala Glu Ala 
195 

Glu Gin Arg Arg 
210 

Asn Gly Gin Val 
225 

Met Cys Gly Leu 

Leu Ser Phe Ser 

260 

Glu Leu lie Gin 
275 

Leu Arg Arg Phe 
290 

Tyr Glu Phe His 
305 

Pro Gin Met Leu 

His Val Asp Phe 

340 

Gly Ser His Leu 
355 

Val Thr Leu Lys 
370 

Pro Gly Ala Gin 
385 

Ala Leu Pro Leu 





70 






Cys 


Pro 


Phe 


He 


85 








or 


O f~* v 




ASp 




val 


vax 










ion 
j. a c 


Vj JL LI 




Ala 


V— y s> 






13 S 

-L -J —J 




Asm 




x im 


UJ.U 




150 






Leu 


Leu 


Ala 


Gly 


XD3 








inr 


Asp 




Met 


Lys 


GlU 


Ala 


Leu 








T A n 


uin 


Lys 


jfro 


Gly 






215 




Asn 


Gly 


Arg 


Pro 




230 






Leu 


Leu 


Val 


Gly 










Met 


Glu 


Phe 


Leu 


Arg 


Pro 


Glu 


Arg 








280 


Ser 


Leu 


Val 


Ala 






295 




Gly 


Val 


Gin 


Leu 




310 






Ser 


Gly Leu Asp 


325 








Ser 


Arg 


Gin 


Lys 


Cys 


Leu 


Gly 


Gin 








360 


Glu 


Trp 


Leu 


Thr 






375 




lie 


Gin 


His 


Lys 




390 






Val 


Trp 


Asp 


Pro 


405 









75 

Pro Arg Glu Ala 
90 

Pro Pro Glu Gin 
105 

Met Pro Val Val 

Ser Leu He Glu 

14 0 

Asp Tyr Ala Glu 
155 

Leu Pro Glu Glu 
170 

Thr Arg Pro Asp 
18 5 

Tyr Asp Tyr Leu 

Thr Asp Ala He 

220 

He Thr Ser Asp 
235 

Gly Leu Asp Thr 
250 

Ala Lys Ser Pro 
265 

He Pro Ala Ala 

Asp Gly Arg He 

300 

Lys Lys Gly Asp 
315 

Glu Arg Glu Asn 
330 

Val Ser His Thr 
345 

His Leu Ala Arg 

Arg He Pro Asp 

380 

Ser Gly He Val 
3 95 

Ala Thr Thr Lys 
410 



80 

Gly Glu Ala Tyr 
95 

Arg Gin Phe Arg 
110 

Asp Lys Leu Glu 

125 

Ser Leu Arg Pro 

Pro Phe Pro He 

160 

Asp He Pro His 
175 

Gly Ser Met Thr 
190 

He Pro lie He 
205 

Ser He Val Ala 

Glu Ala Lys Arg 

240 

Val Val Asn Phe 
255 

Glu His Arg Gin 
270 

Cys Glu Glu Leu 
285 

Leu Thr Ser Asp 

Gin lie Leu Leu 

320 

Ala Cys Pro Met 
335 

Thr Phe Gly His 
350 

Arg Glu He He 
365 

Phe Ser He Ala 

Ser Gly Val Gin 

400 

Ala Val 



<210> 49 
























<211> 374 
























<212> PRT 
























<213> Equus caballus 




















<400> 49 
























Ser Thr Ala 


Gly Lys 


Val 


He 


Lys 


Cys Lys 


Ala 


Ala 


Val 


Leu 


Trp 


Glu 


1 


5 








10 










15 




Glu Lys Lys 


Pro Phe 


Ser 


He 


Glu 


Glu Val 


Glu 


Val 


Ala 


Pro 


Pro 


Lys 




20 








25 








30 




Ala His Glu 


Val Arg 


He 


Lys 


Met 


Val Ala Thr Gly 


He 


Cys 


Arg 


Ser 


35 








40 








45 








Asp Asp His 


Val Val 


Ser 


Gly Thr Leu Val 


Thr 


Pro 


Leu 


Pro 


val 


He 


50 






55 








60 










Ala Gly His 


Glu Ala 


Ala 


Gly 


He 


Val Glu 


Ser 


He 


Gly 


Glu 


Gly 


Val 


65 




70 








75 








80 


Thr Thr Val 


Arg Pro 


Gly 


Asp 


Lys 


Val He 


Pro 


Leu 


Phe 


Thr 


Pro 


Gin 



85 90 95 
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Cys 


Giy 


jjys 


L.ys 


Arg 


val 


Cys 


Lys 


His 


Pro 


Glu 


Gly Asn 


Phe 


Cys 


Leu 








inn 










105 










110 






Lya 


Asn 


Asp 


Leu 


Ser 


Kit — A-_ 

Met 


Pro 


Arg Gly Thr 


Met 


Gin Asp Gly Thr 


Ser 






TIC 

lib 










120 










125 








Arg 


Pete 


Ttir 


Cys 


Arg 


Gly 


Lys 


Pro 


He 


His 


His 


Phe 


Leu Gly Thr 


Ser 




130 










135 










140 










Thr 


Phe 


Ser 


Gin 


Tyr 


ml* « 

Thr 


Val 


Val 


Asp 


Glu 


lie 


Ser 


Val 


Ala 


Lys 


He 


145 










150 










155 










160 


Asp 


Ala 


Til 

Ala 


Jber 


Pro 


Leu 


Glu 


Lys 


Val 


Cys 


Leu 


He 


Gly Cys 


Gly 


Phe 










165 










170 










175 




Ser 


Thr 


Gly 


Tyr 
180 


Gly 


Ser 


Ala 


Val 


Lys 
185 


Val 


Ala 


Lys 


Val 


Thr 
190 


Gin 


Gly 


Ser 


Thr 


Cys 


Ala 


Val 


Phe 


Gly 


Leu Gly Gly Val 


Gly 


Leu 


Ser 


Val 


He 






195 










200 










205 








Met 


Gly 


Cys 


Lys 


Ala 


Ala 


Gly 


Ala 


Ala 


Arg 


He 


He 


Gly Val 


Asp 


He 




210 










215 










220 










Asn 


Lys 


Asp 


Lys 


Phe 


Ala 


Lys 


Ala 


Lys 


Glu 


Val 


Gly Ala 


Thr 


Glu 


Cys 


225 










230 










235 










240 


Val 


Asn 


Pro 


Gin 


Asp 
245 


Tyr 


Lys 


Lys 


Pro 


He 
250 


Gin 


Glu 


Val 


Leu 


Thr 
255 


Glu 


Met 


Ser 


Asn 


Gly 
260 


Gly 


Val 


Asp 


Phe 


Ser 
265 


Phe 


Glu 


Val 


He 


Gly 
270 


Arg 


Leu 


Asp 


Thr 


Met 
275 


Val 


Thr 


Ala 


Leu 


Ser 
280 


Cys 


Cys 


Gin 


Glu 


Ala 
285 


Tyr 


Gly 


Val 


Ser 


val 


lie 


Val 


Gly Val 


Pro 


Pro 


Asp 


Ser 


Gin 


Asn 


Leu 


Ser 


Met 


Asn 




290 










295 










300 










Pro 


Met 


Leu 


Leu 


Leu 


Ser 


Gly 


Arg 


Thr 


Trp 


Lys 


Gly 


Ala 


He 


Phe 


Gly 


305 










310 










315 










320 


Gly 


Phe 


Lys 


Ser 


Lys 
325 


Asp 


Ser 


Val 


Pro 


Lys 
330 


Leu 


Val 


Ala 


Asp 


Phe 
335 


Met 


Ala 


Lys 


Lys 


Phe 
340 


Ala 


Leu 


Asp 


Pro 


Leu 
345 


He 


Thr 


His 


Val 


Leu 
350 


Pro 


Phe 


Glu 


Lys 


lie 


Asn 


Glu 


Gly Phe 


Asp 


Leu 


Leu 


Arg 


Ser 


Gly 


Glu 


Ser 


He 






355 










360 










365 








Arg 


Thr 
370 


He 


Leu 


Thr 


Phe 























<210> 50 
<211> 297 
<212> PRT 

<213> Escherichia coli 
<400> 50 

Met Ala Thr Asn Leu Arg Gly Val Met Ala Ala Leu Leu Thr Pro Phe 

1 5 10 15 

Asp Gin Gin Gin Ala Leu Asp Lys Ala Ser Leu Arg Arg Leu Val Gin 

20 25 30 

Phe Asn He Gin Gin Gly He Asp Gly Leu Tyr Val Gly Gly Ser Thr 

35 40 45 

Gly Glu Ala Phe Val Gin Ser Leu Ser Glu Arg Glu Gin Val Leu Glu 

50 55 60 

He Val Ala Glu Glu Gly Lys Gly Lys He Lys Leu He Ala His Val 
65 70 75 80 

Gly Cys Val Thr Thr Ala Glu Ser Gin Gin Leu Ala Ala Ser Ala Lys 

85 90 95 

Arg Tyr Gly Phe Asp Ala Val Ser Ala Val Thr Pro Phe Tyr Tyr Pro 

100 105 110 

Phe Ser Phe Glu Glu His Cys Asp His Tyr Arg Ala He He Asp Ser 

115 120 125 

Ala Asp Gly Leu Pro Met Val Val Tyr Asn He Pro Ala Leu Ser Gly 

130 135 140 

Val Lys Leu Thr Leu Asp Gin He Asn Thr Leu Val Thr Leu Pro Gly 
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145 










150 










155 










160 


Val 


Glv 


Ala 


Leu 


Lvs 


Gin 


Thr 


Ser Gly 


Asp 


Leu 


Tyr 


Gin 


Met 


Glu 


Gin 










165 










170 










175 




lie 

**- ^ 


At" a 


Arg 


Glu 


His 


Pro Asp 


Leu 


Val 


Leu 


Tyr 


Asn 


Gly Tyr Asp Glu 








180 










185 










190 






•Lie 


xrne 


Ala 


Ser 




Leu 


Leu 


Ala 


Gly 


Ala 


Asp Gly 


Gly 


He 


Gly Ser 






195 










200 










205 








Thr 


Tyr 


Asn 


He 


Met 


Gly Trp 


Arg Tyr 


Gin 


Gly 


He 


Val 


Lys 


Ala 


Leu 




210 










215 










220 










Lys 


Glu 


Gly Asp 


He 


Gin 


Thr 


Ala 


Gin 


Lys 


Leu 


Gin 


Thr 


Glu 


Cys 


Asn 


225 










230 










235 










240 


Lys 


Val 


He 


Asp 


Leu 


Leu 


He 


Lys 


Thr 


Gly Val 


Phe 


Arg Gly Leu 


Lys 










245 










250 










255 




Thr 


Val 


Leu 


His 
260 


Tyr 


Met 


Asp 


Val 


Val 

265 


Ser 


Val 


Pro 


Leu 


Cys 
270 


Arg 


Lys 


Pro 


Phe 


Gly 
275 


Pro 


Val 


Asp 


Glu 


Lys 
280 


Tyr 


Leu 


Pro 


Glu 


Leu 
285 


Lys 


Ala 


Leu 


Ala 


Gin 
290 


Gin 


Leu 


Met 


Gin 


Glu 
295 


Arg 


Gly 

















<210> 51 

<211> 268 

<212> PRT 

<213> Salmonella typhimurium 

<400> 51 



Met 


Glu 


Arg 


Tyr 


Glu 


Asn 


Leu 


Phe 


Ala 


Gin 


Leu 


Asn 


Asp 


Arg 


Arg 


Glu 


1 








5 










10 










15 




Gly 


Ala 


Phe 


Val 


Pro 


Phe 


Val 


Thr 


Leu 


Gly Asp 


Pro 


Gly 


He 


Glu 


Gin 








20 










25 










30 






Ser 


Leu 


Lys 


He 


He 


Asp 


Thr 


Leu 


He 


Asp Ala 


Gly 


Ala 


Asp 


Ala 


Leu 






35 










40 










45 








Glu 


Leu 


Gly 


Val 


Pro 


Phe 


Ser 


Asp 


Pro 


Leu 


Ala 


Asp 


Gly 


Pro 


Thr 


He 




50 










55 










60 










Gin 


Asn 


Ala 


Asn 


Leu 


Arg 


Ala 


Phe 


Ala" 


Ala 


Gly 


Val 


Thr 


Pro 


Ala 


Gin 


65 










70 










75 










80 


Cys 


Phe 


Glu 


Met 


Leu 


Ala 


Leu 


He 


Arg 


Glu 


Lys 


His 


Pro 


Thr 


He 


Pro 










85 










90 










95 




He 


Gly 


Leu 


Leu 


Met 


Tyr 


Ala 


Asn 


Leu 


Val 


Phe 


Asn 


Asn 


Gly 


He 


Asp 








100 










105 










110 






Ala 


Phe 


Tyr 


Ala 


Arg 


Cys 


Glu 


Gin 


Val 


Gly 


Val 


Asp 


Ser 


Val 


Leu 


Val 






115 










120 










125 








Ala 


Asp 


Val 


Pro 


Val 


Glu 


Glu 


Ser 


Ala 


Pro 


Phe 


Arg 


Gin 


Ala 


Ala 


Leu 




130 










135 










140 










Arg 


His 


Asn 


He 


Ala 


Pro 


He 


Phe 


He 


Cys 


Pro 


Pro 


Asn 


Ala 


Asp 


Asp 


145 










150 










155 










160 


Asp 


Leu 


Leu 


Arg 


Gin 


Val 


Ala 


Ser 


Tyr 


Gly Arg 


Gly 


Tyr 


Thr 


Tyr 


Leu 










165 










170 










175 




Leu 


Ser 


Arg 


Ser 


Gly 


Val 


Thr 


Gly Ala 


Glu 


Asn 


Arg 


Gly Ala 


Leu 


Pro 








180 










185 










190 






Leu 


His 


His 


Leu 


He 


Glu 


Lys 


Leu 


Lys 


Glu 


Tyr 


His 


Ala 


Ala 


Pro 


Ala 






195 










200 










205 








Leu 


Gin 


Gly 


Phe 


Gly 


He 


Ser 


Ser 


Pro 


Glu 


Gin 


Val 


Ser 


Ala 


Ala 


Val 




210 










215 










220 










Arg 


Ala 


Gly 


Ala 


Ala 


Gly Ala 


He 


Ser 


Gly Ser 


Ala 


He 


Val 


Lys 


He 


225 










230 










235 










240 


He 


Glu 


Lys 


Asn 


Leu 


Ala 


Ser 


Pro 


Lys 


Gin 


Met 


Leu 


Ala 


Glu 


Leu 


Arg 










245 










250 










255 




Ser 


Phe 


Val 


Ser 


Ala 


Met 


Lys 


Ala 


Ala 


Ser 


Arg 


Ala 











260 265 
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<210> 52 

<211> 393 

<212> PRT 

<213> Actinoplanes missouriensis 



<400> 52 



O a *• 


Val 


Gin 


Til a 


i 








Val 


Gly Trp 


Gin 








o n 

A U 


Leu 


Asp 


Pro 


va J. 






35 




Gly 


lie 


Thr 


Pne 




50 






Gin 


Thr 


Arg 


Asp 


65 








Thr 


Gly 


Leu 


He 


Val 


Phe 


Lys 


Asp 








100 


Tyr 


Ala 


lie 


Arg 






115 




Gly 


Ala 


Lys 


Thr 




130 






Asp 


Ser 


Ala 


Lys 


145 








Leu 


Asn 


Leu 


Leu 


Phe 


Ala 


lie 


Glu 








TOO 

180 


Pro 


Thr 


Ala 


Gly 






195 




Glu 


Leu 


Phe 


Gly 




210 






Leu 


Asn 


Phe 


Thr 


225 








Phe 


His 


He 


Asp 


Leu 


Val 


Phe 


Gly 








260 


Leu 


Leu 


Glu 


Asn 






275 




His 


Phe 


Asp 


Tyr 




290 






Glu 


Ser 


Ala 


Lys 


305 








Ala 


Lys 


Ala 


Phe 


Ser 


Lys 


Val 


Ala 








340 


Tyr 


Ala 


Glu 


Leu 






355 




Asp Ala Val 


Gly 




370 






Ala 


lie 


Glu 


His 


385 









Thr Arg Glu Asp 
5 

Ala Arg Asp Ala 

Glu Ala val His 

40 

His Asp Asp Asp 
55 

Gly He He Ala 
70 

Val Pro Met Val 
85 

Gly Gly Phe Thr 

Lys Val Leu Arg 

120 

Leu Val Leu Trp 
135 

Asp Val Ser Ala 
150 

Ala Gin Tyr Ser 
165 

Pro Lys Pro Asn 

His Ala He Ala 

200 

He Asn Pro Glu 
215 

Gin Gly He Ala 
230 

Leu Asn Gly Gin 
245 

His Gly Asp Leu 

Gly Pro Asp Gly 

280 

Lys Pro Ser Arg 
295 

Ala Asn He Arg 
310 

Arg Ala Asp Pro 
325 

Glu Leu Lys Thr 

Leu Ala Asp Arg 

360 

Ala Lys Gly Phe 
375 

Leu Leu Gly Ala 
390 



Lys Phe Ser Phe 
10 

Phe Gly Asp Ala 
25 

Lys Leu Ala Glu 

Leu Val Pro Phe 

60 

Gly Phe Lys Lys 
75 

Thr Thr Asn Leu 
90 

Ser Asn Asp Arg 
105 

Gin Met Asp Leu 

Gly Gly Arg Glu 

140 

Ala Leu Asp Arg 
155 

Glu Asp Arg Gly 
170 

Glu Pro Arg Gly 
185 

Phe Val Gin Glu 

Thr Gly Asn Glu 

220 

Gin Ala Leu Trp 
235 

His Gly Pro Lys 
250 

Leu Asn Ala Phe 
265 

Ala Pro Ala Tyr 

Thr Glu Asp Tyr 

300 

Met Tyr Leu Leu 
315 

Glu Val Gin Glu 

. 330 
Pro Thr Leu Asn 
345 

Ser Ala Phe Glu 

Gly Phe Val Lys 

380 

Arg 



Gly Leu Trp Thr 
15 

Thr Arg Thr Ala 
30 

He Gly Ala Tyr 
45 

Gly Ser Asp Ala 

Ala Leu Asp Glu 

80 

Phe Thr His Pro 
95 

Ser Val Arg Arg 
110 

Gly Ala Glu Leu 
125 

Gly Ala Glu Tyr 

Tyr Arg Glu Ala 

160 

Tyr Gly Leu Arg 
175 

Asp He Leu Leu 
190 

Leu Glu Arg Pro 
205 

Gin Met Ser Asn 

His Lys Lys Leu 

240 

Phe Asp Gin Asp 
255 

Ser Leu Val Asp 
270 

Asp Gly Pro Arg 
285 

Asp Gly Val Trp 

Leu Lys Glu Arg 

320 

Ala Leu Ala Ala 
335 

Pro Gly Glu Gly 
350 

Asp Tyr Asp Ala 
365 

Leu Asn Gin Leu 



<210^ 53 

<211> 348 

<212> PRT 

<213> Bacteriophage T7 
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<400> 53 



Val 


Asn 


He 


Lys 


Thr 


Asn 


Pro 


Phe 


Lys 


Ala 


Val 


Ser 


Phe 


Val 


Glu 


Ser 


l 








5 










10 










15 




Ala 


lie 


Lys 


Lys 


Ala 


Leu Asp 


Asn 


Ala Gly Tyr 


Leu 


He 


Ala 


Glu 


He 








A U 










25 










3 0 








iyr 






val 


Arg 


Gly Asn 


He 


Cys 


Val 




7\ c ti 


i xi r 


Aia 


TA *m w« 

Asn 






-J Z3 










40 


















Ser 


Tyr 


Trp 


Leu 


Ser Arg Val 


oer 


Lys 


Thr 


He 


Pro 


Ala 


Leu 


Glu 


His 














55 










a a 
o U 










Leu 


Asn 


Gly 


Phe 


Asp 


Val 


Arg 


Trp 


Lys 


Arg 


Leu 


Leu 


Asn 


Asp 


Asp 


Arg 


65 










70 










75 










80 


Cys 


Phe 


Tyr 


Lys 


Asp Gly 


Phe 


Met 


Leu Asp Gly 


Glu 


Leu 


Met 


Val 


Lys 










85 










90 










95 




o»iy 


vai 


Asp 




Asn 


Thr 


Gly 


Ser 


Gly Leu Leu 


Arg 


Til IT 


Lys 


Trp 


Tnr 








100 










105 










110 






Asp 


T'Vi -v- 


T T F 

iiya 
115 


TV <^ 

Asn 


Gin 


Glu 


Phe 


His 
120 


Glu 


Glu 


Leu 




val 
125 


Glu 


Pro 


lie 


TV -w^"* 

Arg 


Lys 
13 0 




Asp 


Lys 


Val 


Pro 
135 


Phe 


Lys 


Leu 


His 


Tnr 
140 


Gly 


His 


Leu 


His 


lie 


Lys 


Leu 


Tyr 


Ala 


He 


Leu 


Pro 


Leu 


His 


He 


Val 


Glu 


Ser 


Gly 


Glu 


145 










150 










155 










160 


Asp 


Cys 


Asp 


Val 


Met 
165 


Thr 


Leu 


Leu 


Met 


Gin 
170 


Glu 


His 


val 


Lys 


Asn 
175 


Met 


Leu 


Pro 


Leu 


Leu 
180 


Gin 


Glu 


Tyr 


Phe 


Pro 
185 


Glu 


He 


Glu 


Trp 


Gin 
190 


Ala 


Ala 


Glu 


Ser 


Tyr 
195 


Glu 


Val 


Tyr 


Asp 


Met 
200 


Val 


Glu 


Leu 


Gin 


Gin 
205 


Leu 


Tyr 


Glu 


Gin 


Lys 


Arg 


Ala 


Glu Gly His 


Glu 


Gly Leu 


He 


Val 


Lys 


Asp 


Pro 


Met 




210 










215 










220 










Cys 


He 


Tyr 


Lys 


Arg 


Gly 


Lys 


Lys 


Ser 


Gly Trp 


Trp 


Lys 


Met 


Lys 


Pro 


225 










230 










235 










240 


Glu 


Asn 


Glu 


Ala 


Asp 


Gly 


He 


He 


Gin Gly Leu 


Val 


Trp 


Gly 


Thr 


Lys 










245 










250 










255 




Gly 


Leu 


Ala 


Asn 


Glu Gly Lys Val 


He 


Gly 


Phe 


Glu 


Val 


Leu 


Leu 


Glu 








260 










265 










270 






Ser 


Gly 


Arg 
275 


Leu 


Val 


Asn 


Ala 


Thr 
280 


Asn 


He 


Ser 


Arg 


Ala 
285 


Leu 


Met 


Asp 


Glu 


Phe 
290 


Thr 


Glu 


Thr 


Val 


Lys 
295 


Glu 


Ala 


Thr 


Leu 


Ser 
300 


Gin 


Trp 


Gly 


Phe 


Phe 


Ser 


Pro 


Tyr 


Gly 


He 


Gly Asp 


Asn Asp 


Ala 


Cys 


Thr 


He 


Asn 


Pro 


305 










310 










315 










320 


Tyr 


Asp 


Gly 


Trp 


Ala 

325 


Cys 


Gin 


He 


Ser 


Tyr 
330 


Met 


Glu 


Glu 


Thr 


Pro 
335 


Asp 


Gly 


Ser 


Leu 


Arg 
340 


His 


Pro 


Ser 


Phe 


Val 
345 


Met 


Phe 


Arg 











<210> 54 

<211> 42 

<212> DNA 

<213> artificial sequence 
<220> 

<223> binding site for restrl and restr2 
<220> 

<221> CDS 

<222> (2) . . (40) 
<223> 



<400> 54 

g gtg gta tea gca ggc cac tgc tac aag tec cgc ate cag gt 42 
Val Val Ser Ala Gly His Cys Tyr Lys Ser Arg He Gin 
15 10 



WO 2004/113522 



42 



PCT/EP2004/051173 



<210> 


55 


<211> 


13 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


binding site for restrl 


<400> 


55 



Val Val Ser Ala Gly His Cys Tyr Lys Ser Arg lie Gin 
1 5 10 . 



<210> 56 

<211> 42 

<212> DNA 

<213> artificial sequence 
<220> 

<223> forward primer restrl 

<400> 56 

ggtggtatcc gcgggccact gctacaagtc ccggatccag gt 42 



<210> 


57 


<211> 


42 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


reverse primer restr2 


<400> 


57 



acctggatcc gggacttgta gcagtggccc gcggatacca cc 4 2 



<210> 58 

<211> 50 

<212> DNA 

<213> artificial sequence 
<220> 

<223> binding site for restr3 and restr4 
<220> 

<221> CDS 

<222> (3) .-. (50) 
<223> 

<400> 58 

cc act ggc acg aag tgc etc ate tct ggc tgg ggc aac act gcg age 4 7 

Thr Gly Thr Lys Cys Leu lie Ser Gly Trp Gly Asn Thr Ala Ser 
15 10 15 

tct 50 

Ser 



<210> 59 

<2l!> 16 

<212> PRT 

<213> artificial sequence 
<220> 

<223> binding site for restr3 and restr4 



<400> 59 



WO 2004/113522 



PCT/EP2004/051 173 



43 



Thr Gly Thr Lys Cys Leu He Ser Gly Trp Gly Asn Thr Ala Ser Ser 
15 10 15 



<210> 60 

<211> 50 

<212> DNA 

<213> artificial sequence 
<220> 

<223> forward primer restr3 

<400> 60 

ccactggcac gaagtgcctc atctctggct ggggcaacac tgcgagctct 50 



<210> 


61 


<211> 


50 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


reverse primer restr4 


<400> 


61 



agagctagca gtgttgcccc agccagagat gaggcacttg gtaccagtgg 5 0 



<210> 62 
<211> 30 
<212> DNA 

<213> • artificial sequence 
<220> 

<223> primer puc- forward 
<400> 62 

ggggtacccc accaccatga atccactcct 3 0 



<210> 


63 




<211> 


30 




<212> 


DNA 




<213> 


artificial 


sequence 


<220> 






<223> 


primer puc 


- reverse 


<400> 


63 





cgggatccgg tatagagact gaagagatac 3 0 



<210> 64 

<211> 39 

<212> DNA 

<213> artificial sequence 
<220> 

<223> oligox-SDRlf 
<220> 

<221> misc_feature 

<222> (14) . . (31) 

<223> any nucleotide 
<220> 

<221> misc_f eature 

<222> (14) . . (31) 

<223> any nucleotide or amino acid residue 
<220> 
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44 

<221> CDS 
<222> (2) . . (37) 
<223> 

<400> 64 

g ggc cac tgc tac ruin nnn nnn nnn nnn nnn aag tec eg 3 9 

Gly His Cys Tyr Xaa Xaa Xaa Xaa Xaa Xaa Lys Ser 
1 5 10 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 



65 
12 
PRT 

artificial 



sequence 



misc_f eature 
(5) . . (5) 

The 'Xaa' at location 5 stands for Lys, 
Met, Glu, Asp, Gly, Ala, val , Gin, His, 
yr , Trp , Cys , or Phe . 



Asn, Arg, Ser, Thr, lie, 
Pro, Leu, a stop codon, T 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(6) . . (6) 

The 'Xaa 1 at location 6 stands for 
Met, Glu, Asp, Gly, Ala, Val, Gin, 
yr, Trp, Cys, or Phe. 



Lys , 
His, 



Asn, 
Pro, 



misc_f eature 
(7) . . (7) 

The 'Xaa 1 at location 7 stands for Lys, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, 
yr , Trp , Cys , or Phe . 



Asn, 
Pro, 



Arg, 
Leu, 



Arg, 
Leu, 



Ser, Thr, lie, 
a stop codon, T 



Ser, Thr, He, 
a stop codon, T 



<220> 
<221> 
c222> 
<223> 



mi sc_f eature 
(8) . . (8) 

The 'Xaa' at location 8 stands for Lys, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, 
yr , Trp , Cys , or Phe . 



Asn, Arg, Ser, Thr, He, 
Pro, Leu, a stop codon, T 



<220> 
<221> 
<222> 
<223> 



misc__feature 
(9) . . (9) 

The 'Xaa' at location 9 stands for Lys, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, 
yr, Trp, Cys, or Phe. 



Asn, Arg, Ser, Thr, He, 
Pro, Leu, a stop codon, T 



<220> 
<221> 
<222> 
<223> 



<220> 
<223> 
<220> 
<221> 
<222> 
<223> 



misc_feature 
(10) . . (10) 

The 'Xaa' at location 10 stands for Lys, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, 
Tyr, Trp, Cys, or Phe. 



oligox-SDRlf 



Asn, Arg, Ser, Thr, He, 
Pro, Leu, a stop codon, 



mi sc_f eature 
(14) . . (31) 
any nucleotide 
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45 



<220> 

<221> raise feature 
<222> <14)..(31) 

<223> any nucleotide or amino acid residue 
<400> 65 

Gly His Cys Tyr Xaa Xaa Xaa Xaa Xaa Xaa Lys Ser 
15 10 



<210> 


66 


<211> 


45 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


oligox-SDRlr 


<220> 




<221> 


mi s c_f eat ur e 


<222> 


(16) . . (33) 


<223> 


any nucleotide 


<400> 


66 



cgcccggtga cgatgnnnnn nnnnnnnnnn nnnttcaggg cctag 4 5 



<2l'0> 67 

<211> 47 

<212> DNA 

<213> artificial sequence 
<220> 

<223> oligox-SDR2f 
<220> 

<221> CDS 

<222> (2) . . (46) 
<223> 
<220> 

<221> mis cofeature 

<222> (29) . . (43) 

<223> any nucleotide or amino acid residue 



<400> 67 

c aag tgc etc ate tct ggc tgg ggc aac nnn nnn nnn nnn nnn act g 4 7 

Lys Cys Leu lie Ser Gly Trp Gly Asn Xaa Xaa Xaa Xaa Xaa Thr 
15 10 15 



<210> 68 
<211> 15 
<212> PRT 

<213> artificial sequence 
<220> 

<221> misc_feature 
<222> (10) . . (10) 

<223> The 'Xaa 1 at location 10 stands for Lys, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, 
Tyr, Trp, Cys, or Phe. 

<220> 

<221> misc_feature 
<222> (11) . . (11) 

<223> The 'Xaa 1 at location 11 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys, or Phe. 



Asn, Arg, Ser, Thr, lie, 
Pro, Leu, a stop codon, 
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46 
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<220> 

<221> mi sc_f eature 
c222> (12) . . (12) 

<223> The • Xaa 1 at location 12 stands for Lys ( Asn, Arg, Ser, Thr, He, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys, or Phe. 

«=220> 

<221> misc_f eature 
<222> (13) . . (13) 

<223> The 'Xaa* at location 13 stands for Lys , Asn, Arg, Ser, Thr, He, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys, or Phe. 

<220> 

<221> misc_f eature 
<222> (14) . . (14) 

<223> The 'Xaa* at location 14 stands for Lys, Asn, Arg, Ser, Thr, lie, 
Met, Glu, Asp, Gly, Ala, Val, Gin, His, Pro, Leu, a stop codon, 
Tyr, Trp, Cys, or Phe. 

<220> 

<223> oligox-SDR2f 
<220> 

<221> misc_f eature 
<222> (29) . . (43) 

<223> any nucleotide or amino acid residue 
<4O0> 68 

Lys Cys Leu He Ser Gly Trp Gly Asn xaa Xaa Xaa Xaa Xaa Thr 
15 10 15 



<210> 


69 


<211> 


55 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


oligox-SDR2r 


<220> 




<221> 


mi sc_f eature 


<222> 


(33) . . (47) 


<223> 


any base 


<220> 




<221> 


misc feature 


<222> 


(33) . . (47) 


<223> 


any nucleotide 


<400> 


69 



catggttcac ggagtagaga ccgaccccgt tgnnnnnnnn nnnnnnntga cgatc 55 



<210> 70 

<211> 59 

<212> DNA 

<213> artificial sequence 
<220> 

<223> primer SDR1 -mutnnb- forward 
<220> 

<221> mi sc_f eature 

<222> (24) . . (40) 

<223> N=A, C, G, T; B=C, G, T; V=A, C, G 
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47 
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<400> 70 

tggtatccgc gggccactgc tacnnbnnbn nbnnbnnbnn baagtcccgg atccaggtg 59 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<220> 
<221> 
<222> 
<223> 


71 
52 
DNA 

artificial sequence 
primer SDR2-mutnnb- reverse 

misc feature 
(20) . . (33) 

N=A, C, G, T; B=C, G, T; V 


<400> 71 

ggcgccagag ctagcagtvn nvruivruivim 


<210> 


72 


<211> 


6 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant g SDR1 


<400> 


72 


Ala Phe Phe Asn Gly Asp 


1 


5 


<210> 


73 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant g SDR2 


<400> 


73 


Arg Lys Asp Pro Trp 


1 


5 


<210> 


74 


<211> 


234 


<212> 


PRT 


<213> 


artificial sequence 



52 



<220> 
<223> 



artificial sequence 
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<400> 74 

lie Val Gly Gly Tyr Asn Cys Glu Glu Asn Ser Val Pro Tyr Gin Val 

15 10 15 

Ser Leu Asn Ser Gly Tyr His Phe Cys Gly Gly Ser Leu lie Asn Glu 

20 25 30 

Gin Trp Val Val Ser Ala Gly His Cys Tyr Ala Ala Phe Asn Gly Lys 

35 40 45 

Ser Arg lie Gin Val Arg Leu Gly Glu His Asn lie Glu Val Leu Glu 

50 55 60 

Gly Asn Glu Gin Phe lie Asn Ala Ala Lys lie lie Arg His Pro Gin 
65 70 75 80 

Tyr Asp Arg Lys Thr Leu Asn Asn Asp lie Met Leu lie Lys Leu Ser 

85 90 95 . 

Ser Arg Ala Val lie Asn Ala Arg Val Ser Thr lie Ser Leu Pro Thr 

100 105 ' 110 

Ala Pro Pro Ala Thr Gly Thr Lys Cys Leu lie Ser Gly Trp Gly Asn 

115 120 125 

Arg Lys Asp Phe Trp Thr Ala Ser Ser Gly Ala Asp Tyr Pro Asp Glu 

130 135 140 

Leu Gin Cys Leu Asp Ala Pro Val Leu Ser Gin Ala Lys Cys Glu Ala 
145 150 155 160 

Ser Tyr Pro Gly Lys lie Thr Ser Asn Met Phe Cys Val Gly Phe Leu 

165 170 175 

Glu Gly Gly Lys Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Val Val 

180 185 190 

Cys Asn Gly Gin Leu Gin Gly Val Val Ser Trp Gly Asp Gly Cys Ala 

195 200 205 

Gin Lys Asn Lys Pro Gly Val Tyr Thr Lys Val Tyr Asn Tyr Val Lys 

210 215 220 

Trp lie Lys Asn Thr lie Ala Ala Asn Ser 
225 230 



<210> 75 

<211> 234 

<212> PRT 

<213> artificial sequence 
<220> 

<223> artificial sequence 

<400> 75 

He Val Gly Gly Tyr Asn Cys Glu Glu Asn Ser Val Pro Tyr Gin Val 
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1 

Ser Leu Asn Ser 

20 

Gin Trp Val Val 
35 

Ser Arg lie Gin 
50 

Gly Asn Glu Gin 
65 

Tyr Asp Trp Liys 

Ser Arg Ala Val 

100 

Ala Pro Pro Ala 
115 

Arg Lys Asp Phe 
130 

Leu Gin Cys Leu 
145 

Ser Tyr Pro Gly 

Glu Gly Gly Lys 

180 

Arg Asn Gly Gin 
195 

Gin Lys Asn Lys 
210 

Trp lie Lys Asn 
225 



5 

Gly Tyx His Phe 

Ser Ala Gly His 

40 

Val Arg Leu Gly 
55 

Phe He Asn Ala 
70 

Thr Leu Asn Asn 
85 

He Asn Ala Arg 

Thr Gly Thr Lys 

120 

Trp Thr Ala Ser 

135 

Asp Ala Pro Val 
150 

Lys lie Thr Ser 
165 

Asp Ser Cys Gin 

Leu Gin Gly Val 

200 

Pro Gly Val Tyr 
215 

Thr lie Ala Ala 
230 



49 
10 

Cys Gly Gly Ser 
25 

Cys Tyr Ala Ala 

Glu His Asn He 

60 

Ala Lys He He 
75 

Asp He Met Leu 
90 

Val Ser Thr He 
105 

Cys Leu He Ser 

Ser Gly Ala Asp 

140 

Leu Ser Gin Thr 
155 

Asn Met Phe Cys 
170 

Gly Asp Ser Gly 
185 

Val Ser Trp Gly 

Thr Lys Val Tyr 

220 

Asn Ser 



15 

Leu lie Asn Glu 
30 

Phe Asn Gly Lys 
45 

Gly Val Leu Glu 

Arg His Pro Gin 

80 

He Lys Leu Ser 
95 

Ser Leu Pro Thr 
110 

Gly Trp Gly Asn 
125 

Phe Pro Asp Glu 

Lys Cys Glu Ala 

160 

Val Gly Phe Leu 
175 

Gly Pro Val Val 
190 

Asp Gly Cys Ala 

205 • 

Asn Tyr Val Lys 



<210> 76 

<211> 12 

<212> PRT 

<2X3> artificial sequence 
<220> 

<223> substrate A 



<400> 76 

Leu Leu Trp Leu Gly Arg Val Val Gly Gly Pro Val 
15 10 



<210> 77 

c211> 12 

<212> PRT 

<213> artificial 

<220> 



sequence 
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50 

<223> substrate B 
<400> 77 

Lys Lys Trp Leu Gly Arg Val Pro Gly Gly Pro Val 
15 10 



<210> 


78 


<211> 


6 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant 1 SDR1 


<400> 


78 


Asp Ala Val Gly Arg Asp 


1 


5 


<210> 


79 


<211> 


6 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant 2 SDRl 


<400> 


79 


Asn Gly Arg Asp Leu Glu 


1 


5 


<210> 


80 


<211> 


6 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant3 SDRl 


<400> 


80 


Gly Phe Val Met Phe Asn 


* 

1 


5 


<210> 


81 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variantl SDR2 


<400> 


81 


Arg Val His Pro Ser 


1 


5 


<210> 


82 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant2 SDR2 
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<400> 82 

Val Arg Gly Thr Trp 
1 5 

<210> 83 

<211> 5 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant3 SDR2 

<400> 83 

Arg Ser Pro Leu Thr 



<210> 84 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant a SDR1 

<400> 84 

Arg Pro Trp Asp Pro Ser 



<210> 85 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant b SDR1 

<400> 85 

Gly Phe Val Met Phe Asn 



<210> 86 

<211> 6 

<212> PRT 

<213> artificial sequence 
<220> 

<223> variant c SDR1 

<400> 86 

Glu lie Ala Asn Arg Glu 



<210> 87 

<211> 6 

<212> PRT 

<213> artificial sequence 
«:220> 

<223> variant d SDR1 



<400> 87 

Lys Ala Val Val Gly Thr 
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52 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



88 
6 

PRT 

artificial sequence 
variant e SDR1 



<400> 88 

val Asn lie Met Ala Ala 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



89 
6 

PRT 

artificial sequence 
variant f SDR1 



<400> 89 

Ala Ala Phe Asn Gly Asp 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<400> 



90 
5 

PRT 

artificial sequence 



variant a 
90 



SDR2 



Val His Pro Thr Ser 
1 S 



c210> 
<211> 
<212> 
<213> 
<220> 
<223> 



91 
5 

PRT 

artificial sequence 
variant b SDR2 



<400> 91 

Arg Ser Pro Leu Thr 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 



92 
5 

PRT 

artificial sequence 
variant c SDR2 



<400> 92 

Arg Gly Ala Arg Thr 
1 5 
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53 
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<210> 


93 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant d SDR2 


<400> 


93 


Arg Thr Pro He Ser 


1 


5 


<210> 


94 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant e SDR 2 


<400> 


94 


Thr Thr Ala Arg Lys 


1 


5 


<210> 


95 


<211> 


5 


<212> 


PRT 


<213> 


artificial sequence 


<220> 




<223> 


variant f SDR2 


<400> 


95 


Arg Lys Asp Phe Trp 


1 


5 


<210> 


96 


<211> 


157 


<212> 


PRT 


<213> 


Homo sapiens 


<400> 


96 



Val 


Arg 


Ser 


Ser 


Ser 


Arg 


Thr 


Pro 


Ser 


Asp 


Lys 


Pro 


Val 


Ala 


His 


Val 


1 








5 










10 










15 




Val 


Ala 


Asn 


Pro 


Gin 


Ala 


Glu 


Gly Gin 


Leu 


Gin 


Trp 


Leu 


Asn 


Arg 


Arg 








20 










25 










30 






Ala 


Asn 


Ala 


Leu 


Leu 


Ala 


Asn 


Gly 


Val 


Glu 


Leu 


Arg 


Asp 


Asn 


Gin 


Leu 






35 










40 










45 








Val 


Val 


Pro 


Ser 


Glu 


Gly 


Leu 


Tyr 


Leu 


He 


Tyr 


Ser 


Gin 


Val 


Leu 


Phe 




50 










55 










60 










Lys 


Gly 


Gin 


Gly 


Cys 


Pro 


Ser 


Thr 


His 


Val 


Leu 


Leu 


Thr 


His 


Thr 


He 


65 










70 










75 










80 


Ser 


Arg 


He 


Ala 


Val 


Ser 


Tyr 


Gin 


Thr 


Lys 


Val 


Asn 


Leu 


Leu 


Ser 


Ala 










85 










90 










95 




He 


Lys 


Ser 


Pro 


Cys 


Gin 


Arg 


Glu 


Thr 


Pro 


Glu 


Gly 


Ala 


Glu 


Ala 


Lys 








100 










105 










110 






Pro 


Trp 


Tyr 


Glu 


Pro 


He 


Tyr 


Leu Gly Gly Val 


Phe 


Gin 


Leu 


Glu 


Lys 






1X5 










120 










125 








Gly 


Asp 


Arg 


Leu 


Ser 


Ala 


Glu 


He 


Asn Arg 


Pro 


Asp 


Tyr 


Leu 


Leu 


Phe 




130 










135 










140 










Ala 


Glu 


Ser 


Gly 


Gin 


Val 


Tyr 


Phe 


Gly 


He 


He 


Ala 


Leu 








145 










150 










155 
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<210> 97 

<211> 306 

<212> PRT 

<213> Homo sapiens 



<400> 97 



11€ 


i rp 




Leu Lys 


Lys 


Asp 


Val 


Tyr 


vai 


Val 


GlU 


Leu Asp 


Trp 


Tyr 


1 








5 










10 










15 




Pro 


Asp 


AJLa 


Pro Gly Glu Met 


Val 


Val 


Leu 


Thr 


Cys 


Asp 


Thr 


Pro 


Glu 








20 










25 










30 






GlU 


Asp 


Gly 


He 


Thr 


Trp 


Thr 


Leu 


Asp 


Gin 


Ser 


Ser 


Glu 


Val 


Leu 


Gly 






35 










40 










45 








Ser 


Gly 


Lys 


Thr 


Leu 


Thr 


He 


Gin 


Val 


Lys 


Glu 


Phe 


Gly Asp Ala 


Gly 




50 










55 










60 










Gin 


Tyr 


Thr 


Cys 


His 


Lys 


Gly Gly Glu 


Val 


Leu 


Ser 


His 


Ser 


Leu 


Leu 


65 










70 










75 










80 


Leu 


Leu 


His 


Lys 


Lys 


Glu 


Asp 


Gly 


He 


Trp 


Ser 


Thr 


Asp 


lie 


Leu 


Lys 










85 










90 










95 




Asp 


Gin 


Lys 


Glu 


Pro 


Lys 


Asn 


Lys 


Thr 


Phe 


Leu 


Arg 


Cys 


Glu 


Ala 


Lys 








100 










105 










110 






Asn 


Tyr 


Ser 


Gly 


Arg 


Phe 


Thr 


Cys 


Trp 


Trp 


Leu 


Thr 


Thr 


He 


Ser 


Thr 






115 










120 










125 








Asp 


Leu 


Thr 


Phe 


Ser 


Val 


Lys 


Ser 


Ser 


Arg 


Gly 


Ser 


Ser 


Asp 


Pro 


Gin 




130 










135 










140 










Gly 


Val 


Thr 


Cys 


Gly Ala 


Ala 


Thr 


Leu 


Ser 


Ala 


Glu 


Arg 


Val 


Arg 


Gly 


145 










150 










155 










160 


Asp 


Asn 


Lys 


Glu 


Tyr 


Glu 


Tyr 


Ser 


Val 


Glu 


Cys 


Gin 


Glu 


Asp 


Ser 


Ala 










165 










170 










175 




Cys 


Pro 


Ala 


Ala 


Glu 


Glu 


Ser 


Leu 


Pro 


He 


Glu 


Val 


Met 


Val 


Asp 


Ala 








180 










185 










190 






Val 


His 


Lys 


Leu 


Lys 


Tyr 


Glu 


Asn Tyr 


Thr 


Ser 


Ser 


Phe 


Phe 


He 


Arg 






195 










200 










205 








Asp 


He 


He 


Lys 


Pro 


Asp 


Pro 


Pro 


Lys 


Asn 


Leu 


Gin 


Leu 


Lys 


Pro 


Leu 




210 










215 










220 










Lys 


Asn 


Ser 


Arg 


Gin 


Val 


Glu 


Val 


Ser 


Trp 


Glu 


Tyr 


Pro 


Asp 


Thr 


Trp 


225 










230 










235 










240 


Ser 


Thr 


Pro 


His 


Ser 


Tyr 


Phe 


Ser 


Leu 


Thr 


Phe 


Cys 


Val 


Gin 


Val 


Gin 










245 










250 










255 




Gly 


Lys 


Ser 


Lys 


Arg 


Glu 


Lys 


Lys 


Asp 


Arg 


Val 


Phe 


Thr 


Asp 


Lys 


Thr 








260 










265 










270 






Ser 


Ala 


Thr 


Val 


He 


Cys 


Arg 


Lys 


Asn 


Ala 


Ser 


He 


Ser 


Val 


Arg 


Ala 






275 










280 










285 








Gin 


Asp 


Arg 


Tyr 


Tyr 


Ser 


Ser 


Ser 


Trp 


Ser 


Glu 


Trp 


Ala 


Ser 


Val 


Pro 



290 295 300 



Cys Ser 
305 



<210> 98 

<211> 157 

<212> PRT 

<213> Homo sapiens 



<400> 98 

Tyr Phe Gly Lys 

1 

Asp Gin Val Leu 

20 

Met Thr Asp Ser 
35 

He Ser Met Tyr 
50 



Leu Glu Ser Lys 
5 

Phe He Asp Gin 

Asp Cys Arg Asp 

40 

Lys Asp Ser. Gin 
55 



Leu Ser Val He 
10 

Gly Asn Arg Pro 
25 

Asn Ala Pro Arg 

Pro Arg Gly Met 

60 



Arg Asn Leu Asn 
15 

Leu Phe Glu Asp 
30 

Thr He Phe He 
45 

Ala Val Thr He 
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55 



Ser Val Lys Cys 
65 

lie Ser Phe Lys 

Ser Asp lie lie 

100 

Met Gin Phe Glu 
115 

Lys Glu Arg Asp 
130 

Gly Asp Arg Ser 
145 



Glu Lys He Ser 
70 

Glu Met Asn Pro 
85 

Phe Phe Gin Arg 

Ser Ser Ser Tyr 

120 

Leu Phe Lys Leu 
135 

He Met Phe Thr 
150 



Thr Leu Ser Cys 
75 

Pro Asp Asn He 
90 

Ser Val Pro Gly 
105 

Glu Gly Tyr Phe 

He Leu Lys Lys 

140 

Val Gin Asn Glu 
155 



Glu Asn Lys He 

80 

Lys Asp Thr Lys 
95 

His Asp Asn Lys 
110 

Leu Ala cys Glu 
125 

Glu Asp Glu Leu 
Asp 



<210> 99 

<211> 133 

<212> PRT 

<213> Homo sapiens 



<400> 99 

Ala Pro Thr Ser 

1 

Leu Leu Leu Asp 

20 

Asn Pro Lys Leu 
35 

Lys Ala Thr Glu 
50 

Pro Leu Glu Glu 
65 

Arg Pro Arg Asp 

Lys Gly Ser Glu 

100 

Thr He Val Glu 
115 

He Ser Thr Leu 
130 



Ser Ser Thr Lys 
5 

Leu Gin Met He 

Thr Arg Met Leu 

40 

Leu Lys His Leu 
55 

Val Leu Asn Leu 
70 

Leu He Ser Asn 
85 

Thr Thr Phe Met 

Phe Leu Asn Arg 

120 

Thr 



Lys Thr Gin Leu 
10 

Leu Asn Gly He 
25 

Thr Phe Lys Phe 

Gin Cys Leu Glu 

60 

Ala Gin Ser Lys 
75 

He Asn Val He 
90 

Cys Glu Tyr Ala 
105 

Trp lie Thr Phe 



Gin Leu Glu His 
15 

Asn Asn Tyr Lys 
30 

Tyr Met Pro Lys 
45 

Glu Glu Leu Lys 

Asn Phe His Leu 

80 

Val Leu Glu Leu 
95 

Asp Glu Thr Ala 
110 

Cys Gin Ser He 
125 



<210> 100 

<211> 72 

<212> PRT 

<213> Homo sapiens 



<400> 100 

Ser Ala Lys Glu 

1 

Phe His Pro Lys 

20 

His Cys Ala Asn 
35 

Leu Cys Leu Asp 
50 

Phe Leu Lys Arg 
65 



Leu Arg Cys Gin 
5 

Phe He Lys Glu 

Thr Glu He He 

40 

Pro Lys Glu Asn 
55 

Ala Glu Asn Ser 
70 



Cys He Lys Thr 
10 

Leu Arg Val He 
25 

Val Lys Leu Ser 

Trp Val Gin Arg 

60 



Tyr Ser Lys Pro 
15 

Glu Ser Gly Pro 
30 

Asp Gly Arg Glu 
45 

Val Val Glu Lys 



<210> 101 

<211> 74 

<212> PRT 

<213> Homo sapiens 
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<400> 101 

Gly Pro Ala Ser Val Pro Thr Thr Cys Cys Phe Asn Leu Ala Asn Arg 

1-5 io 15 

Lys lie Pro Leu Gin Arg Leu Glu Ser Tyr Arg Arg lie Thr Ser Gly 

20 25 30 

Lys Cys Pro Gin Lys Ala Val lie Phe Lys Thr Lys Leu Ala Lys Asp 

35 40 45 

He Cys Ala Asp Pro Lys Lys Lys Trp Val Gin Asp Ser Met Lys Tyr 

50 55 60 

Leu Asp Gin Lys Ser Pro Thr Pro Lys Pro 
65 70 



<210> 102 

<211> 76 

<212> PRT 

<213> Homo sapiens 

<400> 102 

Gin Pro Asp Ala He Asn Ala Pro Val Thr Cys Cys Tyr Asn Phe Thr 

15 10 15 

Asn Arg Lys He Ser Val Gin Arg Leu Ala Ser Tyr Arg Arg lie Thr 

20 25 30 

Ser Ser Lys Cys Pro Lys Glu Ala Val He Phe Lys Thr He Val Ala 

35 40 45 

Lys Glu He Cys Ala Asp Pro Lys Gin Lys Trp Val Gin Asp Ser Met 

50 55 60 

Asp His Leu Asp Lys Gin Thr Gin Thr Pro Lys Thr 
65 70 75 



<210> 103 

<211> 206 

<212> PRT 

<213> Homo sapiens 



<400> 103 



Ala 


Pro 


Met 


Ala 


1 . 








Phe 


Met 


Asp 


Val 








20 


Val 


Asp 


He 


Phe 






35 




Pro 


Ser 


Cys 


Val 




50 






Gly 


Leu 


Glu 


Cys 


65 








Met 


Arg 


He 


Lys 


Leu 


Gin 


His 


Asn 








100 


Gin 


Glu 


Lys 


Lys 






115 




Arg 


Lys 


Lys 


Ser 




130 






Cys 


Cys 


Leu 


Met 


145 








Cys 


Ser 


Glu Arg 


Lys 


Cys 


Ser 


Cys 








180 


Glu 


Leu 


Asn 


Glu 






195 





Glu Gly Gly Gly 
5 

Tyr Gin Arg Ser 

Gin Glu Tyr Pro 

40 

Pro Leu Met Arg 
55 

Val Pro Thr Glu 
70 

Pro His Gin Gly 
85 

Lys Cys Glu Cys 

Ser Val Arg Gly 

120 

Arg Tyr Lys Ser 
135 

Pro Trp Ser Leu 
150 

Arg Lys His Leu 
165 

Lys Asn Thr Asp 

Arg Thr Cys Arg 

200 



Gin Asn His His 
10 

Tyr Cys His Pro 
25 

Asp Glu He Glu 

Cy3 Gly Gly Cys 

60 

Glu Ser Asn He 
75 

Gin His He Gly 
90 

Arg Pro Lys Lys 
105 

Lys Gly Lys Gly 

Trp Ser Val Tyr 

140 

Pro Gly Pro His 
155 

Phe Val Gin Asp 
170 

Ser Arg Cys Lys 
185 

Cys Asp Lys Pro 



Glu Val Val Lys 
15 

He Glu Thr Leu 
30 

Tyr lie Phe Lys 
45 

Cys Asn Asp Glu 

Thr Met Gin He 

80 

Glu Met Ser Phe 
95 

Asp Arg Ala Arg 
110 

Gin Lys Arg Lys 
125 

Val Gly Ala Arg 

Pro Cys Gly Pro 

160 

Pro Gin Thr Cys 
175 

Ala Arg Gin Leu 

190 
Arg Arg 
205 
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<210> 104 

<211> 112 

<212> PRT 

<213> Homo sapiens 

<400> 104 



Ala 


Leu 


Asp 


Thr 


Asn 


Tyr 


Cys 


Phe 


Ser 


Ser 


Thr 


Glu 


Lys Asn Cys 


Cys 


1 








5 










10 






15 




val 


Arg 


Gin 


Leu 
20 


Tyr 


lie 


Asp 


Phe 


Arg 
25 


Lys 


Asp 


Leu 


Gly Trp Lys 
3 0 


Trp 


He 


His 


Glu 
35 


Pro 


Lys 


Gly 


Tyr 


His 
40 


Ala 


Asn 


Phe 


Cys 


Leu Gly Pro 
45 


Cys 


Pro 


Tyr 
50 


He 


Trp 


Ser 


Leu 


Asp 
55 


Thr 


Gin 


Tyr 


Ser 


Lys 
60 


Val Leu Ala 


Leu 


Tyr 


Asn 


Gin 


His 


Asn 


Pro 


Gly 


Ala 


Ser 


Ala 


Ala 


Pro 


Cys Cys Val 


Pro 


65 










70 










75 






80 


Gin 


Ala 


Leu 


Glu 


Pro 
85 


Leu 


Pro 


lie 


Val 


Tyr 
90 


Tyr 


Val 


Gly Arg Lys 
95 


Pro 


Lys 


Val 


Glu 


Gin 
100 


Leu 


Ser 


Asn 


Met 


He 
105 


Val 


Arg 


Ser 


Cys Lys Cys 
110 


Ser 



<210> 105 

<211> 30 

<212> PRT 

<213> Homo sapiens 

<400> 105 

Phe Val Asn Gin His Leu Cys Gly Ser His Leu Val Glu Ala Leu Tyr 
15 10 15 

Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Thr Pro Lys Thr 

20 25 30 



<210> 106 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 106 

Gly He Val Glu Gin Cys Cys Thr Ser He Cys Ser Leu Tyr Gin Leu 
15 10 15 

Glu Asn Tyr Cys Asn 

20 



<210> 107 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<400> 107 

Gly Ser Ser Phe Leu Ser Pro Glu 
1 5 

Glu Ser Lys Lys Pro Pro Ala Lys 

20 



His Gin Arg Val Gin Gin Arg Lys 

10 15 
Leu Gin Pro Arg 
25 



<210> 108 
<211> 9 
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<212> PRT 

<213> Homo sapiens 

<400> 108 

Arg Val Tyr lie His Pro Phe His Leu 
1 5 



<210> 109 

<211> 114 

<212> PRT 

<213> Homo sapiens 

<400> 109 

Pro Met Phe lie Val Asn Thr Asn Val . Pro Arg Ala Ser Val Pro Asp 

15 10 15 

Gly Phe Leu Ser Glu Leu Thr Gin Gin Leu Ala Gin Ala Thr Gly Lys 

20 25 30 

Pro Pro Gin Tyr lie Ala Val His Val Val Pro Asp Gin Leu Met Ala 

35 40 45 

Phe Gly Gly Ser Ser Glu Pro Cys Ala Leu Cys Ser Leu His Ser lie 

50 55 60 

Gly Lys lie Gly Gly Ala Gin Asn Arg Ser Tyr Ser Lys Leu Leu Cys 
65 70 75 80 

Gly Leu Leu Ala Glu Arg Leu Arg lie Ser Pro Asp Arg Val Tyr He 

85 90 95 

Asn Tyr Tyr Asp Met Asn Ala Ala Asn Val Gly Trp Asn Asn Ser Thr 

100 105 110 

Phe Ala 



<210> 110 

<211> 425 

<212> PRT 

<213> Homo sapiens 

<400> 110 



Met 


Gly 


Pro 


Arg 


Arg 


Leu 


Leu 


Leu 


Val 


Ala 


Ala 


Cys 


Phe 


Ser 


Leu 


Cys 


1 








5 










10 










15 




Gly 


Pro 


Leu 


Leu 


Ser 


Ala 


Arg 


Thr 


Arg 


Ala 


Arg 


Arg 


Pro 


Glu 


Ser 


Lys 








20 










25 










30 






Ala 


Thr 


Asn 


Ala 


Thr 


Leu 


Asp 


Pro 


Arg 


Ser 


Phe 


Leu 


Leu 


Arg 


Asn 


Pro 






35 










40 










45 








Asn 


Asp 


Lys 


Tyr 


Glu 


Pro 


Phe 


Trp 


Glu 


Asp 


Glu 


Glu 


Lys 


Asn 


Glu 


Ser 




50 










55 










60 










Gly 


Leu 


Thr 


Glu 


Tyr 


Arg 


Leu 


Val 


Ser 


He 


Asn 


Lys 


Ser 


Ser 


Pro 


Leu 


65 










70 










75 










80 


Gin 


Lys 


Gin 


Leu 


Pro 


Ala 


Phe 


He 


Ser 


Glu 


Asp 


Ala 


Ser 


Gly 


Tyr 


Leu 










85 










90 










95 




Thr 


Ser 


Ser 


Trp 


Leu 


Thr 


Leu 


Phe 


Val 


Pro 


Ser 


Val 


Tyr 


Thr 


Gly Val 








100 










105 










110 






Phe 


Val 


Val 


Ser 


Leu 


Pro 


Leu 


Asn 


He 


Met 


Ala 


He 


Val 


Val 


Phe 


He 






115 










120 










125 








Leu 


Lys 


Met 


Lys 


Val 


Lys 


Lys 


Pro 


Ala 


Val 


Val 


Tyr 


Met 


Leu 


His 


Leu 




130 










135 










140 










Ala 


Thr 


Ala 


Asp 


Val 


Leu 


Phe 


Val 


Ser 


Val 


Leu 


Pro 


Phe 


Lys 


He 


Ser 


145 










150 










155 










160 


Tyr 


Tyr 


Phe 


Ser 


Gly 


Ser 


Asp 


Trp 


Gin 


Phe 


Gly 


Ser 


Glu 


Leu 


Cys 


Arg 










165 










170 










175 




Phe 


Val 


Thr 


Ala 


Ala 


Phe 


Tyr 


Cys 


Asn 


Met 


Tyr 


Ala 


Ser 


He 


Leu 


Leu 








180 










185 










190 






Met 


Thr 


Val 


lie 


Ser 


He 


Asp 


Arg 


Phe 


Leu 


Ala 


Val 


Val 


Tyr 


Pro 


Met 
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59 

















200 










O f\ c 








Gin 


Ser 


Leu 


Ser 


Trp 


Arg 


Thr 


Leu Gly 


Arg 


Ala 


Ser 


Phe 


Thr 


Cys 


Leu 




O 1 ft 

210 










215 










220 










Ala 


-r- T _ . 

He 


Trp 


Ala 


Leu 


Ala 


He 


Ala 


Gly 


Val 


Val 


Pro 


Leu 


Leu 


Leu 


Lys 


2 25 










23 0 










235 










240 


Glu 


Gin 


Tnr 


He 


Gin 


Val 


Pro 


Gly Leu 


Asn 


He 


Thr 


Thr 


Cys 


His 


Asp 










245 










250 










f\ ^ m r— 

255 




Val 


Leu 


Asn 


GlU 


Tnr 


Leu 


Leu Glu Gly 


Tyr 


Tyr 


Ala 


Tyr 


Tyr 


Phe 


Ser 


















265 










O *7 f\ 

270 






Ala 


Phe 


Ser 


Ala 


Val 


Pne 


Phe 


Phe 


Val 


Pro 


Leu 


lie 


He 


Ser 


Thr 


Val 






275 










280 










285 








Cys 


Tyr 


Val 


Ser 


He 


He 


Arg 


Cys 


Leu 


Ser 


Ser 


Ser 


Ala 


Val 


Ala 


Asn 




290 










295 










300 










Arg 


Ser 


Lys 


Lys 


Ser 


Arg 


Ala 


Leu 


Phe 


Leu 


Ser 


Ala 


Ala 


Val 


Phe 


Cys 


305 










310 










315 










320 


He 


Phe 


He 


He 


Cys 


Phe 


Gly 


Pro 


Thr 


Asn 


Val 


Leu 


Leu 


He 


Ala 


His 










325 










330 










335 




Tyr 


Ser 


Phe 


Leu 


Ser 


His 


Thr 


Ser 


Thr 


Thr 


Glu 


Ala 


Ala 


Tyr 


Phe 


Ala 








340 










345 










350 






Tyr 


Leu 


Leu 


Cys 


Val 


Cys 


Val 


Ser 


Ser 


lie 


Ser 


Cys 


Cys 


He 


Asp 


Pro 






355 










360 










365 








Leu 


He 


Tyr 


Tyr 


Tyr 


Ala 


Ser 


Ser 


Glu 


Cys 


Gin 




Tvr 


Val 


Tvr 
j 






370 










375 










380 










He 


Leu 


Cys 


Cys 


Lys 


Glu 


Ser 


Ser 


Asp 


Pro 


Ser 


Ser 


Tyr 


Asn 


Ser 


Ser 


385 










390 










395 










400 


Gly 


Gin 


Leu 


Met 


Ala 


Ser 


Lys 


Met 


Asp 


Thr 


Cys 


Ser 


Ser 


Asn 


Leu 


Asn 










405 










410 










415 




Asn 


Ser 


He 


Tyr 


Lys 


Lys 


Leu 


Leu 


Thr 






















420 










425 
















<210> 111 




























c211> 397 




























<212> PRT 




























<213> Homo 


sapiens 
























<400> 111 




























Met 


Arg 


Ser 


Pro 


Ser 


Ala 


Ala 


Trp 


Leu 


Leu 


Gly Ala 


Ala 


He 


Leu 


Leu 


1 








5 










10 










15 




Ala 


Ala 


Ser 


Leu 


Ser 


Cys 


Ser 


Gly Thr 


lie 


Gin 


Gly 


Thr 


Asn 


Arg 


Ser 








20 










25 










30 






Ser 


Lys 


Gly 


Arg 


Ser 


Leu 


He 


Gly Lys 


Val 


Asp 


Gly 


Thr 


Ser 


His 


Val 






35 










40 










45 








Thr 


Gly 


Lys 


Gly 


Val 


Thr 


Val 


Glu 


Thr 


Val 


Phe 


Ser 


Val 


Asp 


Glu 


Phe 




50 










55 










60 










Ser 


Ala 


Ser 


Val 


Leu 


Thr 


Gly Lys 


Leu 


Thr 


Thr 


Val 


Phe 


Leu 


Pro 


He 


65 










70 










75 










80 


Val 


Tyr 


Thr 


He 


Val 


Phe 


Val 


Val 


Gly 


Leu 


Pro 


Ser 


Asn 


Gly 


Met 


Ala 










85 










90 










95 




Leu 


Trp 


Val 


Phe 


Leu 


Phe 


Arg 


Thr 


Lys 


Lys 


Lys 


His 


Pro 


Ala 


Val 


He 








100 










105 










110 


J 




Tyr 


Met 


Ala 


Asn 


Leu 


Ala 


Leu 


Ala 


Asp 


Leu 


Leu 


Ser 


Val 


He 


Trp 


Phe 






115 










120 










125 








Pro 


Leu 


Lys 


He 


Ala 


Tyr 


His 


He 


His 


Gly Asn 


Asn 


Trp 


He 


Tyr 


Gly 




130 










135 










140 










Glu 


Ala 


Leu 


Cys 


Asn 


Val 


Leu 


He 


Gly 


Phe 


Phe 


Tyr 


Gly Asn 


Met 


Tyr 


145 










150 










155 










160 


Cys 


Ser 


He 


Leu 


Phe 


Met 


Thr 


Cys 


Leu 


Ser 


Val 


Gin 


Arg 


Tyr 


Trp 


Val 










165 










170 










175 




He 


Val 


Asn 


Pro 


Met 


Gly 


His 


Ser 


Arg 


Lys 


Lys 


Ala 


Asn 


He 


Ala 


He 








180 










185 










190 






Gly 


He 


Ser 


Leu 


Ala 


He 


Trp 


Leu 


Leu 


He 


Leu 


Leu 


Val 


Thr 


He 


Pro 






195 










200 










205 
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Leu 


Tvr 


Val 


Val 


Lvs 


Gin 


Thr 


lie 


Phe 


He 


Pro 


Ala 


Leu 


Asn 


He 


J. 11 IS 




210 










2 15 

*G -I. o 










220 












Pvs 


His 


Asp 


Va3 






V3J. U 


Gin 


Leu 


Leu 


Val 


Gly Asp Met 


Fne 


225 










230 










235 










o a. n 




Tvr 

Jr 


Phe 


Leu 


CC1 




Al a 


Tl ^ 


Gly Val 


Phe 


Leu 


Phe 


Pro 


Til , 

Ala 


PJtie 










245 

M ~J 










250 










t—\ r— r— 

255 






Th-r 


Ala 


Ser 


Al a 


xyr 


v ct JL 


iJcU 


Met 


He 


Arg Met 


Leu Arg 


Ser 










260 










265 










270 






Al a 


rlC 


Asp 


Glu 


A en 


O 62 X. 


f2l ii 




Lys 


Arg 


Lys 


Arg 


Ala 


He 


Lys 


Leu 






275 




















285 
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645 650 655 

Arg Gin Leu Thr Val Leu Glu Arg Val Ala Pro Thr lie Thr Gly Asn 

660 665 670 

Leu Glu Asn Gin Thr Thr Ser lie Gly Glu Ser lie Glu Val Ser Cys 

675 680 685 

Thr Ala Ser Gly Asn Pro Pro Pro Gin He Met Trp Phe Lys Asp Asn 

690 695 700 

Glu Thr Leu Val Glu Asp Ser Gly He Val Leu Lys Asp Gly Asn Arg 
705 710 715 720 

Asn Leu Thr He Arg Arg Val Arg Lys Glu A3p Glu Gly Leu Tyr Thr 

725 730 735 

Cys Gin Ala Cys Ser Val Leu Gly Cys Ala Lys Val Glu Ala Phe Phe 

740 745 750 

He He Glu Gly Ala Gin Glu Lys Thr Asn Leu Glu He lie He Leu 

755 760 765 

Val Gly Thr Ala Val He Ala Met Phe Phe Trp Leu Leu Leu Val He 

770 775 780 

He Leu Arg Thr Val Lys Arg Ala Asn Gly Gly Glu Leu Lys Thr Gly 
785 790 795 800 

Tyr Leu Ser He Val Met Asp Pro Asp Glu Leu Pro Leu Asp Glu His 

805 810 815 

Cys Glu Arg Leu Pro Tyr Asp Ala Ser Lys Trp Glu Phe Pro Arg Asp 

82 0 825 83 0 

Arg Leu Lys Leu Gly Lys Pro Leu Gly Arg Gly Ala Phe Gly Gin Val 

835 840 845 

He Glu Ala Asp Ala Phe Gly He Asp Lys Thr Ala Thr Cys Arg Thr 

850 855 860 

Val Ala Val Lys Met Leu Lys Glu Gly Ala Thr His Ser Glu His Arg 
865 870 875 880 

Ala Leu Met Ser Glu Leu Lys He Leu He His He Gly His His Leu 

885 890 895 

Asn Val Val Asn Leu Leu Gly Ala Cys Thr Lys Pro Gly Gly Pro Leu 

900 905 910 

Met Val He Val Glu Phe Cys Lys Phe Gly Asn Leu Ser Thr Tyr Leu 

915 920 925 

Arg Ser Lys Arg Asn Glu Phe Val Pro Tyr Lys Thr Lys Gly Ala Arg 

930 935 940 

Phe Arg Gin Gly Lys Asp Tyr Val Gly Ala He Pro Val Asp Leu Lys 
945 950 955 960 

Arg Arg Leu Asp Ser He Thr Ser Ser Gin Ser Ser Ala Ser Ser Gly 

965 970 975 

Phe Val Glu Glu Lys Ser Leu Ser Asp Val Glu Glu Glu Glu Ala Pro 

980 985 990 

Glu Asp Leu Tyr Lys Asp Phe Leu Thr Leu Glu His Leu lie Cys Tyr 

995 1000 1005 

Ser Phe Gin Val Ala Lys Gly Met Glu Phe Leu Ala Ser Arg Lys 

1010 1015 1020 

Cys He His Arg Asp Leu Ala Ala Arg Asn He Leu Leu Ser Glu 

1025 1030 1035 

Lys Asn Val Val Lys He Cys Asp Phe Gly Leu Ala Arg Asp lie 

1040 1045 1050 

Tyr Lys Asp Pro Asp Tyr Val Arg Lys Gly Asp Ala Arg Leu Pro 

1055 1060 1065 

Leu Lys Trp Met Ala Pro Glu Thr He Phe Asp Arg Val Tyr Thr 

1070 1075 1080 

He Gin Ser Asp Val Trp Ser Phe Gly Val Leu Leu Trp Glu He 

1085 1090 1095 

Phe Ser Leu Gly Ala Ser Pro Tyr Pro Gly Val Lys He Asp Glu 

1100 1105 mo 

Glu Phe Cys Arg Arg Leu Lys Glu Gly Thr Arg Met Arg Ala Pro 

1115 1120 1125 

Asp Tyr Thr Thr Pro Glu Met Tyr Gin Thr Met Leu Asp Cys Trp 
1130 1135 1140 
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His Gly Glu Pro Ser Gin Arg Pro Thr Phe Ser Glu Leu Val Glu 

1145 1150 1155 

His Leu Gly Asn Leu Leu Gin Ala Asn Ala Gin Gin Asp Gly Lye 

1160 1165 1170 

Asp Tyr lie Val Leu Pro lie Ser Glu Thr Leu Ser Met Glu Glu 

1175 1180 1185 

Asp Ser Gly Leu Ser Leu Pro Thr Ser Pro Val Ser Cys Met Glu 

1190 1195 1200 

Glu Glu Glu Val Cys Asp Pro Lys Phe His Tyr Asp Asn Thr Ala 

1205 1210 1215 

Gly He Ser Gin Tyr Leu Gin Asn Ser Lys Arg Lys Ser Arg Pro 
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Val Ser Val Lys Thr Phe Glu Asp He Pro Leu Glu Glu Pro Glu 
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Leu Ala Ser Glu Glu Leu Lys Thr Leu Glu Asp Arg Thr Lys Leu 
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Ser Pro Ser Phe Gly Gly Met Val Pro Ser Lys Ser Arg Glu Ser 
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Val Ala Ser Glu Gly Ser Asn Gin Thr Ser Gly Tyr Gin Ser Gly 

1295 1300 1305 

Tyr His Ser Asp Asp Thr Asp Thr Thr Val Tyr Ser Ser Glu Glu 

1310 1315 1320 

Ala Glu Leu Leu Lys Leu He Glu He Gly Val Gin Thr Gly Ser 

1325 1330 1335 

Thr Ala Gin He Leu Gin Pro Asp Ser Gly Thr Thr Leu Ser Ser 

1340 1345 1350 

Pro Pro Val 

1355 
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Leu Val Cys Arg 
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Arg Asn Tyr Val 
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Leu 


Met 


Thr 
885 


Phe 


Gly 


Ser 


Lys 


Pro 
890 


Tyr 


Asp 


Gly 


He 


Pro 
895 


Ala 


Ser 


Glu 


He 


Ser 
900 


Ser 


He 


Leu 


Glu 


Lys 
905 


Gly 


Glu 


Arg 


Leu 


Pro 
910 


Gin 


Pro 


Pro 


lie 


Cys 
915 


Thr 


He 


Asp 


Val 


Tyr 
920 


Met 


He 


Met 


Val 


Lys 
925 


Cys 


Trp 


Met 


lie 


Asp 
930 


Ala 


Asp 


Ser 


Arg 


Pro 
935 


Lys 


Phe 


Arg 


Glu 


Leu 
940 


He 


He 


Glu 


Phe 


Ser 


Lys 


Met 


Ala 


Arg 


Asp 


Pro 


Gin 


Arg 


Tyr 


Leu 


Val 


He 


Gin 


Gly 


Asp 


945 










950 










955 










960 


Glu 


Arg 


Met 


His 


Leu 
965 


Pro 


Ser 


Pro 


Thr 


Asp 
970 


Ser 


Asn 


Phe 


Tyr 


Arg 
975 


Ala 


Leu 


Met 


Asp 


Glu 
980 


Glu 


Asp 


Met 


Asp 


Asp 
985 


Val 


Val 


Asp 


Ala 


Asp 
990 


Glu 


Tyr 



Leu He Pro Gin Gin Gly Phe Phe Ser Ser Pro Ser Thr Ser Arg Thr 
995 1000 1005 



Pro 


Leu 
1010 


Leu 


Ser 


Ser 


Leu 


Ser 
1015 


Ala 


Thr 


Ser 


Asn 


Asn 
1020 


Ser 


Thr 


val 


Ala 


Cys 
1025 


He 


Asp 


Arg 


Asn 


Gly 
1030 


Leu 


Gin 


Ser 


Cys 


Pro 
1035 


He 


Lys 


Glu 


Asp 

* 


Ser 


Phe 


Leu 


Gin 


Arg 


Tyr 


Ser 


Ser 


Asp 


Pro 


Thr 


Gly 


Ala 


Leu 




1040 










1045 










1050 








Thr 


Glu 


Asp 


Ser 


He 


Asp 


Asp 


Thr 


Phe 


Leu 


Pro 


Val 


Pro 


Glu 


Tyr 




1055 










1060 










1065 






He 


Asn 
1070 


Gin 


Ser 


Val 


Pro 


Lys 
1075 


Arg 


Pro 


Ala 


Gly 


Ser 
1080 


Val 


Gin 


Asn 


Pro 


Val 
1085 


Tyr 


His 


Asn 


Gin 


Pro 
1090 


Leu 


Asn 


Pro 


Ala 


Pro 
1095 


Ser 


Arg 


Asp 


Pro 


His 
1100 


Tyr 


Gin 


Asp 


Pro 


His 
1105 


Ser 


Thr 


Ala 


Val 


Gly 
1110 


Asn 


Pro 


Glu 


Tyr 


Leu 


Asn 


Thr 


Val 


Gin 


Pro 


Thr 


Cys 


Val 


Asn 


Ser 


Thr 


Phe 


Asp 




1115 










1120 










1125 






Ser 


Pro 
1130 


Ala 


His 


Trp 


Ala 


Gin 
1135 


Lys 


Gly 


Ser 


Hi3 


Gin 
1140 


He 


Ser 


Leu 


Asp 


Asn 


Pro 


Asp 


Tyr 


Gin 


Gin 


Asp 


Phe 


Phe 


Pro 


Lys 


Glu 


Ala 


Lys 




1145 










1150 










1155 






Pro 


Asn 


Gly 


He 


Phe 


Lys 


Gly 


Ser 


Thr 


Ala 


Glu 


Asn 


Ala 


Glu 


Tyr 




1160 










1165 










1170 






Leu 


Arg 
117S 


Val 


Ala 


Pro 


Gin 


Ser 
1180 


Ser 


Glu 


Phe 


He 


Gly 
1185 


Ala 
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<210> 117 

<211> 422 

<212> PRT 

<213> Homo sapiens 



<:400> 117 



Met 


Asp 


Val 


Leu 


Ser 


Pro 


Gly Gin 


1 








5 








Ala 


Pro 


Phe 


Glu 


Thr 


Gly 


Gly Asn 








20 










Val 


Ser 


Tyr 


Gin 


Val 


He 


Thr 


Ser 






35 










40 


Cys 


Ala 


Val 


Leu 


Gly Asn 


Ala 


Cys 




50 










55 




Arg 


Ser 


Leu 


Gin 


Asn 


Val 


Ala 


Asn 


65 










70 






Thr 


Asp 


Leu 


Met 


Val 


Ser 


Val 


Leu 










85 








Gin 


Val 


Leu 


Asn 


Lys 


Trp 


Thr 


Leu 








100 










lie 


Ala 


Leu 


Asp 


Val 


Leu 


Cys 


Cys 






115 










120 


Ala 


He 


Ala 


Leu 


Asp 


Arg 


Tyr 


Trp 




130 










135 




Val 


Asn 


Lys 


Arg 


Thr 


Pro 


Arg 


Arg 


145 










150 






Trp 


Leu 


He 


Gly 


Phe 


Leu 


He 


Ser 










165 








Thr 


Pro 


Glu 


Asp 


Arg 


Ser 


Asp 


Pro 








180 










His 


Gly 


Tyr 


Thr 


He 


Tyr 


Ser 


Thr 






195 










200 


Leu 


Leu 


Met 


Leu 


Val 


Leu 


Tyr Gly 




210 










215 




Arg 


He 


Arg 


Lys 


Thr 


Val 


Lys 


Lys 


225 










230 






Arg 


His 


Gly 


Ala 


Ser 


Pro 


Ala 


Pro 










245 








Glu 


Ser 


Gly 


Ser 


Arg 


Asn 


Trp 


Arg 








260 










Gly 


Ala 


Leu 


Cys 


Ala 


Asn 


Gly Ala 






275 










280 


Ala 


Leu 


Glu 


Val 


He 


Glu 


Val 


His 




290 










295 




Leu 


Pro 


Leu 


Pro 


Ser 


Glu 


Ala 


Gly 


305 










310 






Phe 


Glu 


Arg 


Lys 


Asn 


Glu 


Arg 


Asn 










325 








Leu 


Ala 


Arg 


Glu 


Arg 


Lys 


Thr 


val 








340 










Thr 


Phe 


He 


Leu 


Cys 


Trp 


Leu 


Pro 






355 










360 


Pro 


Phe 


Cys 


Glu 


Ser 


Ser 


Cys 


His 




370 










375 




He 


Asn 


Trp 


Leu 


Gly 


Tyr 


Ser 


Asn 


385 










390 






Ala 


Tyr 


Phe 


Asn 


Lys 


Asp 


Phe 


Gin 










405 








Cys 


Lys 


Phe 


Cys 


Arg 


Gin 







420 



Gly Asn 


Asn 


Thr 


Thr 


Ser 


Pro 


Pro 




10 










15 




Thr 


Thr 


Gly 


He 


Ser 


Asp 


Val 


Thr 


25 










30 






Leu 


Leu 


Leu 


Gly 


Thr 


Leu 


He 


Phe 










45 








Val 


Val 


Ala 


Ala 


He 


Ala 


Leu 


Glu 








60 










Tyr 


Leu 


He 


Gly 


Ser 


Leu 


Ala 


Val 






75 










80 


Val 


Leu 


Pro 


Met 


Ala 


Ala 


Leu 


Tyr 




90 










95 




Gly 


Gin 


Val 


Thr 


Cys 


Asp 


Leu 


Phe 


105 










110 






Thr 


Ser 


Ser 


He 


Leu 


His 


Leu 


Cys 










125 








Ala 


He 


Thr 


Asp 


Pro 


He 


Asp 


Tyr 








140 










Ala 


Ala 


Ala 


Leu 


He 


Ser 


Leu 


Thr 






155 










160 


He 


Pro 


Pro 


Met 


Leu 


Gly 


Trp 


Arg 




170 










175 




Asp 


Ala 


Cys 


Thr 


He 


Ser 


Lys 


Asp 


185 










190 






Phe 


Gly 


Ala 


Phe 


Tyr 


He 


Pro 


Leu 










205 








Arg 


He 


Phe 


Arg 


Ala 


Ala 


Arg 


Phe 








220 










Val 


Glu 


Lys 


Thr 


Gly Ala 


Asp 


Thr 






235 










240 


Gin 


Pro 


Lys 


Lys 


Ser 


Val 


Asn 


Gly 




250 










255 




Leu Gly 


Val 


Glu 


Ser 


Lys 


Ala 


Gly 


265 










270 






Val 


Arg 


Gin 


Gly 


Asp 


Asp 


Gly 


Ala 










285 








Arg 


Val 


Gly Asn 


Ser 


Lys 


Glu 


His 








300 










Pro 


Thr 


Pro 


Cys 


Ala 


Pro 


Ala 


Ser 






315 










320 


Ala 


Glu 


Ala 


Lys 


Arg 


Lys 


Met 


Ala 




330 










335 




Lys 


Thr 


Leu 


Gly 


He 


He 


Met 


Gly 


345 










350 






Phe 


Phe 


He 


Val 


Ala 


Leu 


Val 


Leu 










365 








Met 


Pro 


Thr 


Leu 


Leu 


Gly 


Ala 


He 








380 










Ser 


Leu 


Leu 


Asn 


Pro 


Val 


He 


Tyr 






395 










400 


Asn 


Ala 


Phe 


Lys 


Lys 


He 


He 


Lys 




410 










415 
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<210> 118 

<211> 129 

<212> PRT 

<213> Homo sapiens 



<400> 118 

His Lys Cys Asp 

1 

Leu Thr Glu Gin 

20 

Phe Ala Ala Ser 
35 

Ala Thr Val Leu 
50 

Cys Leu Gly Ala 
65 

Arg Phe Leu Lys 

Asn Ser Cys Pro 

100 

Leu Glu Arg Leu 
115 

Ser 



lie Thr Leu Gin 
5 

Lys Thr Leu Cys 

Lys Asn Thr Thr 

40 

Arg Gin Phe Tyr 
55 

Thr Ala Gin Gin 
70 

Arg Leu Asp Arg 
85 

Val Lys Glu Ala 

Lys Thr lie Met 

120 



Glu lie He Lys 
10 

Thr Glu Leu Thr 
25 

Glu Lys Glu Thr 

Ser His His Glu 

60 

Phe His Arg His 
75 

Asn Leu Trp Gly 
90 

Asn Gin Ser Thr 
105 

Arg Glu Lys Tyr 



Thr Leu Asn Ser 
15 

Val Thr Asp He 
30 

Phe Cys Arg Ala 
45 

Lys Asp Thr Arg 

Lys Gin Leu lie 

80 

Leu Ala Gly Leu 
95 

Leu Glu Asn Phe 
110 

Ser Lys Cys Ser 
125 



<210> 


119 




























<211> 


113 




























<212> 


PRT 




























<213> 


Homo 


sapiens 
























<400> 


119 




























Met Gly 


Pro 


Val 


Pro 


Pro 


Ser 


Thr 


Ala 


Leu 


Arg 


Glu 


Leu 


He 


Glu 


Glu 


1 






5 










10 










15 




Leu Val 


Asn 


He 


Thr 


Gin 


Asn 


Gin 


Lys 


Ala 


Pro 


Leu 


Cys 


Asn 


Gly 


Ser 






20 










25 










30 




Met Val 


Trp 


Ser 


He 


Asn 


Leu 


Thr 


Ala 


Gly Met 


Tyr 


Cys 


Ala 


Ala 


Leu 




35 










40 










45 








Glu Ser 


Leu 


He 


Asn 


Val 


Ser 


Gly 


Cys 


Ser 


Ala 


He 


Glu 


Lys 


Thr 


Gin 


50 










55 










60 








Arg Met 


Leu 


Ser 


Gly 


Phe 


Cys 


Pro 


His 


Lys 


Val 


Ser 


Ala 


Gly 


Gin 


Phe 


65 








70 










75 








80 


Ser Ser 


Leu 


His 


Val 


Arg 


Asp 


Thr 


Lys 


He 


Glu 


val 


Ala 


Gin 


Phe 


Val 








85 










90 










95 




Lys Asp 


Leu 


Leu 


Leu 


His 


Leu 


Lys 


Lys 


Leu 


Phe 


Arg 


Glu 


Gly 


Arg 


Phe 






100 










105 










110 






Asn 































<210> 120 

<211> 726 

<212> PRT 

<213> Homo sapiens 

<400> 120 

Val Thr Lys Leu Leu Pro Ala Leu 
1 5 
Leu Leu Leu Leu Pro He Ala He 

20 

Arg Arg Asn Thr He His Glu Phe 

35 40 
He Lys He Asp Pro Ala Leu Lys 

50 55 
Ala Asp Gin Cys Ala Asn Arg Cys 



Leu Leu Gin His Val Leu Leu His 

10 15 
Pro Tyr Ala Glu Gly Gin Arg Lys 
25 3 0 

Lys Lys Ser Ala Lys Thr Thr Leu 

45 

He Lys Thr Lys Lys Val Asn Thr 

60 

Thr Arg Asn Lys Gly Leu Pro Phe 
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65 










70 








Thr 


Cys 


Lye 


Ala 


Phe 


Val 


Phe 


Asp 


Lys 










85 










Phe 


Pro 


Phe 


Asn 


Ser 


Met 


Ser 


Ser 


Gly 








100 










105 


Glu 


Phe 


Asp 


Leu 


Tyr 


Glu 


Asn 


Lys 


Asp 






115 










120 




Gly 


Lys 


Gly Arg 


Ser 


Tyr 


Lys 


Gly Thr 




130 










135 






lie 


Lys 


Cys 


Gin 


Pro 


Trp 


Ser 


Ser 


Met 


145 










150 








Leu 


Pro 


Ser 


Ser 


Tyr 


Arg 


Gly 


Lys 


Asp 










165 










Asn 


Pro 


Arg 


Gly 


Glu 


Glu 


Gly 


Gly 


Pro 








180 










185 


Glu 


Val 


Arg 


Tyr 


Glu 


Val 


Cys 


Asp 


He 






195 










200 




Cys 


Met 


Thr 


Cys 


Asn 


Gly 


Glu 


Ser 


Tyr 




210 










215 






Glu 


Ser 


Gly Lys 


He 


Cys 


Gin 


Arg 


Trp 


225 










230 








His 


Lys 


Phe 


Leu 


Pro 


Glu 


Arg 


Tyr 


Pro 










245 










Tyr 


Cys 


Arg 


Asn 


Pro 


Asp 


Gly 


Gin 


Pro 








260 










265 


Asp 


Pro 


His 


Thr 


Arg 


Trp 


Glu 


Tyr 


Cys 






275 










280 




Asn 


Thr 


Met 


Asn 


Asp 


Thr 


Asp 


Val 


Pro 




290 










295 






Gin 


Gly 


Gin Gly 


Glu 


Gly Tyr 


Arg 


Gly 


3 05 










310 








Gly 


He 


Pro 


Cys 


Gin 


Arg 


Trp 


Asp 


Ser 










325 










Met 


Thr 


Pro 


Glu 


Asn 


Phe 


Lys 


Cys 


Lys 








340 










345 


Arg 


Asn 


Pro 


Asp 


Gly 


Ser 


Glu 


Ser 


Pro 






355 










360 




Asn 


He 


Arg 


val 


Gly 


Tyr 


Cys 


Ser 


Gin 




370 










375 






His 


Gly 


Gin 


Asp 


Cys 


Tyr Arg 


Gly Asn 


385 










390 








Leu 


Ser 


Gin 


Thr 


Arg 


Ser 


Gly 


Leu 


Thr 










405 










Met 


Glu 


Asp 


Leu 


His 


Arg 


His 


He 


Phe 








420 










425 


Leu 


Asn 


Glu 


Asn 


Tyr 


Cys 


Arg 


Asn 


Pro 






435 










440 




Trp 


Cys 


Tyr 


Thr 


Gly Asn 


Pro 


Leu 


He 




450 










455 






Ser 


Arg 


Cys 


Glu 


Gly Asp 


Thr 


Thr 


Pro 


465 










470 








Pro 


Val 


He 


Ser 


Cys 


Ala 


Lys 


Thr 


Lys 










485 










He 


Pro 


Thr 


Arg 


Thr 


Asn 


He 


Gly 


Trp 








500 










505 


Asn 


Lys 


His 


He 


Cys 


Gly Gly 


Ser 


Leu 






515 










520 




Thr 


Ala 


Arg 


Gin 


Cys . 


Phe 


Pro 


Ser 


Arg 




530 










535 






Trp 


Leu 


Gly 


He 


His 


Asp 


Val 


His 


Gly 


545 










550 








Gin 


Val 


Leu 


Asn 


val 


Ser 


Gin 


Leu 


Val 










565 
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75 80 



Ala 


Arg 


Lys 


Gin 


Cys 


Leu 


Trp 


90 










95 




Val 


Lys 


Lys 


Glu 


Phe 


Gly His 










110 






Tyr 


He 


Arg 


Asn 


Cys 


He 


He 








125 








Val 


Ser 


He 


Thr 


Lys 


Ser Gly 






140 










He 


Pro 


His 


Glu 


His 


Ser 


Phe 




155 










160 


Leu 


Gin 


Glu 


Asn 


Tyr 


Cys 


Arg 


170 










175 




Trp 


Cys 


Phe 


Thr 


Ser 


Asn 


Pro 










190 






Pro 


Gin 


Cys 


Ser 


Glu 


Val 


Glu 








205 








Arg 


Gly 


Leu 


Met 


Asp 


His 


Thr 






220 










Asp 


His 


Gin 


Thr 


Pro 


His 


Arg 




235 










240 


Asp 


Lys 


Gly 


Phe 


Asp 


Asp 


Asn 


250 










255 




Arg 


Pro 


Trp 


Cys 


Tyr 


Thr 


Leu 










270 






Ala 


He 


Lys 


Thr 


Cys 


Ala 


ASp 








285 








Leu 


Glu 


Thr 


Thr 


Glu 


Cys 


He 






300 










Thr 


Val 


Asn 


Thr 


lie 


Trp 


Asn 




315 










320 


Gin 


Tyr 


Pro 


His 


Glu 


His 


Asp 


330 










335 




Asp 


Leu 


Arg 


Glu 


Asn 


Tyr 


Cys 










350 






Trp 


Cys 


Phe 


Thr 


Thr 


Asp 


Pro 








365 








He 


Pro 


Asn 


Cys 


Asp 


Met 


Ser 






380 










Gly Lys 


Asn 


Tyr 


Met 


Gly Asn 




395 










400 


Cys 


Ser 


Met 


Trp 


Asp 


Lys 


Asn 


410 










415 




Trp 


Glu 


Pro 


Asp 


Ala 


Ser 


Lys 










430 






Asp 


Asp 


Asp 


Ala 


His 


Gly 


Pro 








445 








Pro 


Trp 


Asp 


Tyr 


Cys 


Pro 


He 






460 










Thr 


He 


Val 


Asn 


Leu 


Asp 


His 




475 










480 


Gin 


Leu 


Arg 


Val 


Val 


Asn 


Gly 


490 










495 




Met 


Val 


Ser 


Leu 


Arg 


Tyr 


Arg 










510 






He 


Lys 


Glu 


Ser 


Trp 


Val 


Leu 








525 








Asp 


Leu 


Lys 


Asp 


Tyr 


Glu 


Ala 






540 










Arg 


Gly 


Asp 


Glu 


Lys 


Cys 


Lys 




555 










560 


Tyr Gly 


Pro 


Glu 


Gly 


Ser 


Asp 


570 










575 
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Leu Val Leu Met 

580 

Ser Thr lie Asp 
595 

Ser Cys Ser Val 
610 

Gly Leu Leu Arg 
625 

Ser Gin His His 

Ala Gly Ala Glu 

660 

Gly Pro Leu Val 
675 

He Val Pro Gly 
690 

Val Arg Val Ala 
705 

Tyr Lys Val Pro 



Lys Leu Ala Arg 

Leu Pro Asn Tyr 

600 

Tyr Gly Trp Gly 
615 

Val Ala His Leu 
630 

Arg Gly Lys Val 
645 

Lys He Gly Ser 

Cys Glu Gin His 

680 

Arg Gly Cys Ala 
695 

Tyr Tyr Ala Lys 

710 
Gin Ser 
725 



Pro Ala Val Leu 
585 

Gly Cys Thr He 

Tyr Thr Gly Leu 

620 

Tyr He Met Gly 
635 

Thr Leu Asn Glu 
650 

Gly Pro Cys Glu 
665 

Lys Met Arg Met 

He Pro Asn Arg 

700 

Trp He His Lys 
715 



Asp Asp Phe Val 
590 

Pro Glu Lys Thr 
605 

He Asn Tyr Asp 

Asn Glu Lys Cys 

■ 640 

Ser Glu He Cys 
655 

Gly Asp Tyr Gly 
670 

Val Leu Gly Val 
685 

Pro Gly He Phe 

He He Leu Thr 

720 



<210> 121 

<211> 191 

<212> PRT 

<:213> Homo sapiens 



<400> 121 

Phe Pro Thr lie 

1 

Ala His Arg Leu 

20 

Glu Ala Tyr He 
35 

Gin Thr Ser Leu 
50 

Glu Glu Thr Gin 
65 

Leu Leu He Gin 

Phe Ala Asn Ser 

100 



Pro Leu Ser Arg 
5 

His Gin Leu Ala 

Pro Lys Glu Gin 

40 

Cys Phe Ser Glu 
55 

Gin Lys Ser Asn 
70 

Ser Trp Leu Glu 
85 

Leu Val Tyr Gly 



Leu Phe Asp Asn 
10 

Phe Asp Thr Tyr 
25 

Lys Tyr Ser Phe 

Ser He Pro Thr 

60 

Leu Glu Leu Leu 
75 

Pro Val Gin Phe 
90 

Ala Ser Asp Ser 
105 



Ala Met Leu Arg 
15 

Gin Glu Phe Glu 
30 

Leu Gin Asn Pro 
45 

Pro Ser Asn Arg 

Arg He Ser Leu 

80 

Leu Arg Ser Val 
95 

Asn Val Tyr Asp 
HO 



Leu 


Leu 


Lys 


Asp 


Leu 


Glu 


Glu 


Gly 


He 


Gin 


Thr 


Leu 


Met 


Gly Arg 


Leu 






115 










120 










125 








Glu 


Asp 


Gly 


Ser 


Pro 


Arg 


Thr 


Gly Gin 


He 


Phe 


Lys 


Gin 


Thr 


Tyr 


Ser 




130 










135 










140 










Lys 


Phe 


Asp 


Thr 


Asn 


Ser 


His 


Asn 


Asp 


Asp 


Ala 


Leu 


Leu 


Lys 


Asn 


Tyr 


145 










150 










155 










160 


Gly 


Leu 


Leu 


Tyr 


Cys 
165 


Phe 


Arg 


Lys 


Asp 


Met 
170 


Asp 


Lys 


Val 


Glu 


Thr 
175 


Phe 


Leu 


Arg 


He 


Val 
180 


Gin 


Cys 


Arg 


Ser 


Val 
185 


Glu 


Gly 


Ser 


Cys 


Gly 
190 


Phe 





<210> 122 

<211> 156 

<212> PRT 

<213> Homo sapiens 



<400> 122 

Ala Tyr Arg Pro Ser Glu Thr Leu 

1 5 

Leu Gin Phe Val Cys Gly Asp Arg 



Cys Gly Gly Glu Leu Val Asp Thr 

10 15 
Gly Phe Tyr Phe Ser Arg Pro Ala 
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20 



Ser 


Arg 


vai ser Arg 


Arg 


Ser 


Arg 






35 






40 


Arg 


Ser 


Cys Asp lieu 


Ala 


Leu 


Leu 




50 






55 




Lys 


Ser 


Glu Arg Asp 


Val 


Ser 


Thr 


65 






70 






Phe 


Pro 


Arg Tyr Pro 


Val 


Gly 


Lys 






85 








Gin 


Ser 


Thr Gin Arg 


Leu 


Arg 


Arg 






100 








Arg 


Arg 


Gly His Val 


Leu 


Ala 


Lys 






115 






120 


Lys 


Arg 


His Arg Pro 


Leu 


He 


Ala 




13 0 






135 




Gly 


Gly 


Ala Pro Pro 
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395 










400 


Cys 


Leu 


Glu 


Ser 


His 


Ser 


Ser 


Leu 


Ser 


Asp 


Ser 


Glu 


Phe 


Pro 


Pro 


Asn 










405 










410 










415 




Asn 


Lys 


Gly 


Glu 


He 


Lys 


Thr 


Glu 


Gly 


Gin 


Glu 


Leu 


He 


Thr 


Val 


He 








420 










425 










430 






Lys 


Ala 


Pro 


Thr 


Ser 


Phe 


Gly 


Tyr 


Asp 


Lys 


Pro 


His 


val 


Leu 


Val 


Asp 






435 










440 










445 








Leu 


Leu 


Val 


Asp 


Asp 


Ser 


Gly 


Lys 


Glu 


Ser 


Leu 


He 


Gly Tyr 


Arg 


Pro 




450 










455 










460 










Thr 


Glu 


Asp 


Ser 


Lys 


Glu 


Phe 


Ser 



















465 470 



<210> 137 

<211> 143 

<212> PRT 

<213> Homo sapiens 



<400> 137 



Gin 


Asp 


Pro 


Tyr 


Val 


Lys 


Glu 


Ala 


1 








5 








Ala 


Gly 


His 


Ser- Asp 


Val 


Ala 


Asp 








20 










Leu 


Lys 


Asn 


Trp 


Lys 


Glu 


Glu 


Ser 






35 










40 


He 


Val 


Ser 


Phe 


Tyr 


Phe 


Lys 


Leu 




50 










55 




Ser 


He 


Gin 


Lys 


Ser 


Val 


Glu 


Thr 


65 










70 






Phe 


Phe 


Asn 


Ser 


Asn 


Lys 


Lys 


Lys 










85 








Asn 


Tyr 


Ser 


Val 


Thr 


Asp 


Leu 


Asn 








100 










Leu 


He 


Gin 


Val 


Met 


Ala 


Glu 


Leu 






115 










120 


Arg 


Lys 


Arg 


Ser 


Gin 


Met 


Leu 


Phe 




130 










135 





Glu Asn Leu Lys Lys Tyr Phe Asn 

10 15 
Asn Gly Thr Leu Phe Leu Gly He 
25 30 
Asp Arg Lys lie Met Gin Ser Gin 

45 

Phe Lys Asn Phe Lys Asp Asp Gin 

60 

He Lys Glu Asp Met Asn Val Lys 

75 80 
Arg Asp Asp Phe Glu Lys Leu Thr 

90 95 
Val Gin Arg Lys Ala He His Glu 
105 110 
Ser Pro Ala Ala Lys Thr Gly Lys 

125 

Arg Gly Arg Arg Ala Ser Gin 

14 0 



<210> 138 
<211> 143 
<212> PRT 
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<213> Homo sapiens 
<400> 138 



Met 


Glu 


Ser 


Pro 


Ser 


Ala 


Pro 


Pro His 


Arg 


Trp 


Cys 


He 


Pro 


Trp 


Gin 


1 








5 








10 










15 




Arg 


Leu 


Leu 


Leu 
20 


Thr 


Ala 


Ser 


Leu Leu 
25 


Thr 


Phe 


Trp 


Asn 


Pro 
30 


Pro 


Thr 


Thr 


Ala 


Lys 


Leu 


Thr 


He 


Glu 


Ser Thr 


Pro 


Phe 


Asn 


Val 


Ala 


Glu Gly 






35 










40 








45 








Lys 


Glu 
50 


Val 


Leu 


Leu 


Leu 


Val 
55 


His Asn 


Leu 


Pro 


Gin 
60 


His 


Leu 


Phe 


Gly 


Tyr 


Ser 


Trp 


Tyr 


Lys 


Gly Glu Arg Val 


Asp 


Gly Asn Arg 


Gin 


He 


He 


65 










70 








75 










80 


Gly 


Tyr 


Val 


lie 


Gly 
85 


Thr 


Gin 


Gin Ala 


Thr 
90 


Pro 


Gly 


Pro 


Ala 


Tyr 
95 


Ser 


Gly 


Arg 


Glu 


lie 
100 


He 


Tyr 


Pro 


Asn Ala 
105 


Ser 


Leu 


Leu 


He 


Gin 
110 


Asn 


He 


lie 


Gin 


Asn 
115 


Asp 


Thr 


Gly 


Phe 


Tyr Thr 
120 


Leu 


His 


Val 


He 
125 


Lys 


Ser 


Asp 


Leu 


Val 
130 


Asn 


Glu 


Glu 


Ala 


Thr 
135 


Gly Gin 


Phe 


Arg 


Val 
140 


Tyr 


Arg 


Glu 





<210> 139 

<211> 440 

<212> PRT 

<213> Homo sapiens 



<400> 139 



Glu 


Met 


Val 


Asp 


1 








Glu 


Met 


Thr 


Val 








20 


Thr 


Gly 


Ser 


Ser 






35 




His 


Arg 


Tyr 


Tyr 




50 






Lys 


Gly 


Val 


Tyr 


65 








Gly 


Thr 


Asp 


Leu 


Ala 


Asn 


He 


Ala 








100 


Ser 


Asn 


Trp 


Glu 






115 




Pro 


Asp 


Asp 


Ser 




130 






His 


Val 


Pro 


Asn 


145 








Leu 


Asn 


Gin 


Ser 


Gly 


Gly 


He 


Asp 








180 


lie 


Arg 


Arg 


Glu 






195 




Asn 


Gly 


Gin Asp 




210 






Ser 


He 


Val 


Asp 


225 








Phe 


Glu 


Ala 


Ala 


Phe 


Pro 


Asp 


Gly 



260 



Asn Leu Arg Gly 
5 

Gly Ser Pro Pro 

Asn Phe Ala Val 

40 

Gin Arg Gin Leu 
55 

Val Pro Tyr Thr 
70 

Val Ser He Pro 
85 

Ala He Thr Glu 

Gly He Leu Gly 

120 

Leu Glu Pro Phe 
135 

Leu Phe Ser Leu 
150 

Glu Val Leu Ala 
165 

His Ser Leu Tyr 

Trp Tyr Tyr Glu 

200 

Leu Lys Met Asp 
215 

Ser Gly Thr Thr 
230 

Val Lys Ser He 
245 

Phe Trp Leu Gly 



Lys Ser Gly Gin 
10 

Gin Thr Leu Asn 
25 

Gly Ala Ala Pro 

Ser Ser Thr Tyr 

60 

Gin Gly Lys Trp 
75 

His Gly Pro Asn 
90 

Ser Asp Lys Phe 
105 

Leu Ala Tyr Ala 

Phe Asp Ser Leu 

140 

Gin Leu Cys Gly 
155 

Ser Val Gly Gly 
170 

Thr Gly Ser Leu 
185 

Val He lie Val 

Cys Lys Glu Tyr 

220 

Asn Leu Arg Leu 
235 

Lys Ala Ala Ser 
250 

Glu Gin Leu Val 
265 



Gly Tyr Tyr Val 
15 

He Leu Val Asp 
30 

His Pro Phe Leu 
45 

Arg A3p Leu Arg 

Glu Gly Glu Leu 

80 

Val Thr Val Arg 
95 

Phe He Asn Gly 
110 . 

Glu He Ala Arg 
125 

Val Lys Gin Thr 

Ala Gly Phe Pro 

160 

Ser Met He He 
175 

Trp Tyr Thr Pro 
190 

Arg Val Glu He 
205 

Asn Tyr Asp Lys 

Pro Lys Lys Val 

240 

Ser Thr Glu Lys 
255 

Cys Trp Gin Ala 
270 
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Gly 


Thr 


Thr 


Pro 


Trp Asn 


He 


Phe 


Pro 


Val 






275 










280 






Gly Glu 


Val 


Thr 


Asn 


Gin 


Ser 


Phe 


Arg 


He 




'yon 

z. y u 










295 








iyr 


ucu 


Arg 


Pro 


Val 


Glu Asp Val 


Ala 


Thr 


3 05 










310 










Lys 


Phe 


Ala 


He 


Ser 


Gin 


Ser 


Ser 


Thr Gly 










325 










330 


lie 


Met 


Glu Gly Phe Tyr Val Val 


Phe 


Asp 








340 










345 




Gly 


Phe 


Ala 


Val 


Ser 


Ala 


Cys 


His 


Val 


His 






355 










360 






Ala 


Val 


Glu Gly 


Pro 


Phe 


Val 


Thr 


Leu 


Asp 




370 










375 








Asn 


He 


Pro 


Gin 


Thr 


Asp 


Glu 


Ser 


Thr 


Leu 


385 










390 










Met 


Ala 


Ala 


He 


Cys 


Ala 


Leu 


Phe 


Met 


Leu 










405 










410 


Cys 


Gin 


Trp Arg 


Cys 


Leu 


Arg 


Cys 


Leu 


Arg 








420 










425 




Ala 


Asp 


Asp 


He 


Ser 


Leu 


Leu 


Lys 










435 










440 






<210> 140 
















<211> 810 
















<212> PRT 
















<213> Homo 


sapiens 












<400> 140 
















Met 


Glu 


His 


Lys 


Glu 


Val 


Val 


Leu 


Leu 


Leu 


1 








5 










10 


Gly Gin 


Gly Glu 


Pro 


Leu 


Asp 


Asp 


Tyr 


Val 








20 










25 




Leu 


Phe 


Ser 


Val 


Thr 


Lys 


Lys 


Gin 


Leu 


Gly 






35 










40 






Cys 


Ala 


Ala 


Lys 


Cys 


Glu 


Glu 


Asp 


Glu 


Glu 




50 










55 








Gin 


Tyr 


His 


Ser 


Lys 


Glu 


Gin 


Gin 


Cys 


Val 


65 










70 










Lys 


Ser 


Ser 


lie 


He 


He 


Arg 


Met 


Arg 


Asp 










85 










90 


Lys 


Val 


Tyr 


Leu 


Ser 


Glu 


Cys 


Lys 


Thr 


Gly 








100 










105 




Gly 


Thr 


Met 


Ser 


Lys 


Thr 


Lys 


Asn 


Gly 


He 






115 










120 






Ser 


Thr 


Ser 


Pro 


His 


Arg 


Pro 


Arg 


Phe 


Ser 




130 










135 








Glu 


Gly 


Leu 


Glu 


Glu 


Asn 


Tyr 


Cys 


Arg 


Asn 


145 










150 










Gly Pro 


Trp 


Cys 


Tyr 


Thr 


Thr 


Asp 


Pro 


Glu 










165 










170 


Asp 


He 


Leu 


Glu 


Cys 


Glu 


Glu 


Glu 


Cys 


Met 








180 










185 




Tyr Asp 


Gly Lys 


lie 


Ser 


Lys 


Thr 


Met 


Ser 






195 










200 






Trp 


Asp 


Ser 


Gin 


Ser 


Pro 


His 


Ala 


His 


Gly 




210 










2.15 








Pro 


Asn 


Lys 


Asn 


Leu 


Lys 


Lys 


Asn 


Tyr 


Cys 


225 










230 










Leu 


Arg 


Pro 


Trp 


Cys 


Phe 


Thr 


Thr 


Asp 


Pro 










245 










250 


Cys 


Asp 


He 


Pro 


Arg 


Cys 


Thr 


Thr 


Pro 


Pro 








260 










265 
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He Ser Leu Tyr Leu Met 
285 

Thr He Leu Pro Gin Gin 
300 

Ser Gin Asp Asp Cys Tyr 
315 320 
Thr Val Met Gly Ala Val 

335 

Arg Ala Arg Lys Arg lie 

350 

Asp Glu Phe Arg Thr Ala 
365 

Met Glu Asp Cys Gly Tyr 
380 

Met Thr He Ala Tyr Val 
395 400 
Pro Leu Cys Leu Met Val 

415 

Gin Gin His Asp Asp Phe 

430 



Leu Leu Phe Leu Lys Ser 

15 

Asn Thr Gin Gly Ala Ser 

30 

Ala Gly Ser He Glu Glu 
45 

Phe Thr Cys Arg Ala Phe 
60 

He Met Ala Glu Asn Arg 
75 80 
Val Val Leu Phe Glu Lys 

95 

Asn Gly Lys Asn Tyr Arg 

110 

Thr Cys Gin Lys Trp Ser 
125 

Pro Ala Thr His Pro Ser 
140 

Pro Asp Asn Asp Pro Gin 
155 160 
Lys Arg Tyr Asp Tyr Cys 

175 

His Cys Ser Gly Glu Asn 

190 

Gly Leu Glu Cys Gin Ala 
205 

Tyr He Pro Ser Lys Phe 
220 

Arg Asn Pro Asp Arg Glu 
235 240 
Asn Lys Arg Trp Glu Leu 

255 

Pro Ser Ser Gly Pro Thr 

270 
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Tyr Gin Cys Leu 
275 

Val Thr Val Ser 
290 

His Thr His Asn 
305 

Glu Asn Tyr Cys 

Thr Thr Asn Ser 

340 

Asp Ser Ser Pro 
355 

Glu Leu Thr Pro 
370 

Tyr Arg Gly Thr 
385 

Trp Ser Ser Met 

Pro Asn Ala Gly 

420 

Lys Gly Pro Trp 
435 

Cy3 Asn Leu Lys 
450 

Pro Pro Val Val 
465 

Cys Met Phe Gly 

Val Thr Gly Thr 

500 

His Ser He Phe 
515. 

Asn Tyr Cys Arg 
530 

Thr Thr Asn Pro 
545 

Ala Ala Pro Ser 

Cys Pro Gly Arg 

580 

Pro Trp Gin Val 
595 

Gly Thr Leu He 
610 

Glu Lys Ser Pro 
625 

Gin Glu Val Asn 

Leu Phe Leu Glu 

660 

Ser Pro Ala Val 
675 

Pro Asn Tyr Val 
690 

Gly Glu Thr Gin 
705 

Leu Pro Val He 

Gly Arg Val Gin 

740 

Thr Asp Ser Cys 
755 

Lys Asp Lys Tyr 



Lys Gly Thr Gly 

280 

Gly His Thr Cys 
295 

Arg Thr Pro Glu 
310 

Arg Asn Pro Asp 
325 

Gin Val Arg Trp 

Val Ser Thr Glu 

360 

Val Val Gin Asp 
375 

Ser Ser Thr Thr 
390 

Thr Pro His Arg 
4 05 

Leu Thr Met Asn 

Cys Phe Thr Thr 

440 

Lys Cys Ser Gly 
455 

Leu Leu Pro Asp 
470 

Asn Gly Lys Gly 

4 85 

Pro Cys Gin Asp 

Thr Pro Glu Thr 

520 

Asn Pro Asp Gly 
53 5 

Arg Lys Leu Tyr 
550 

Phe Asp Cys Gly 

5 65 

Val Val Gly Gly 

Ser Leu Arg Thr 

600 

Ser Pro Glu Trp 
615 

Arg Pro Ser Ser 
630 

Leu Glu Pro His 
645 

Pro Thr Arg Lys 

He Thr Asp Lys 

680 

Val Ala Asp Arg 
695 

Gly Thr Phe Gly 
710 

Glu Asn Lys Val 
725 

Ser Thr Glu Leu 

Gin Gly Asp Ser 

760 

He Leu Gin Gly 



Glu Asn Tyr Arg 

Gin Hi3 Trp Ser 

300 

Asn Phe Pro Cys 
315 

Gly Lys Arg Ala 
330 

Glu Tyr Cys Lys 
345 

Gin Leu Ala Pro 

Cys Tyr His Gly 

380 

Thr Thr Gly Lys 
395 

His Gin Lys Thr 
410 

Tyr Cys Arg Asn 
425 

Asp Pro Ser Val 

Thr Glu Ala Ser 

460 

Val Glu Thr Pro 
4 75 

Tyr Arg Gly Lys 
490 

Trp Ala Ala Gin 
505 

Asn Pro Arg Ala 

Asp Val Gly Gly 

540 

Asp Tyr Cys Asp 
555 

Lys Pro Gin Val 
570 

Cys Val Ala His 
585 

Arg Phe Gly Met 

Val Leu Thr Ala 

620 

Tyr Lys Val He 
635 

Val Gin Glu He 
650 

Asp He Ala Leu 
665 

Val He Pro Ala 

Thr Glu Cys Phe 

700 

Ala Gly Leu Leu 
715 

Cys Asn Arg Tyr 
730 

Cys Ala Gly His 
745 

Gly Gly Pro Leu 
Val Thr Ser Trp 



Gly Asn Val Ala 
285 

Ala Gin Thr Pro 

Lys Asn Leu Asp 

320 

Pro Trp Cys His 
335 

He Pro Ser Cys 
350 

Thr Ala Pro Pro 
365 

Asp Gly Gin Ser 

Lys Cys Gin Ser 

400 

Pro Glu Asn Tyr 
415 

Pro Asp Ala Asp 
43 0 

Arg Trp Glu Tyr 
44 5 

Val Val Ala Pro 

Ser Glu Glu Asp 

480 

Arg Ala Thr Thr 
495 

Glu Pro His Arg 
510 

Gly Leu Glu Lys 

525 

Pro Trp Cys Tyr 

Val Pro Gin Cys 

560 

Glu Pro Lys Lys . 
575 

Pro His Ser Trp 
590 

His Phe Cys Gly 
605 

Ala His Cys Leu 

Leu Gly Ala His 

640 

Glu Val Ser Arg 
655 

Leu Lys Leu Ser 
670 

Cys Leu Pro Ser 
685 

He Thr Gly Trp 

Lys Glu Ala Gin 

720 

Glu Phe Leu Asn 
735 

Leu Ala Gly Gly 
750 

Val Cys Phe Glu 
765 

Gly Leu Gly Cys 
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770 775 780 

Ala Arg Pro Asn Lys Pro Gly Val Tyr Val Arg Val Ser Arg Phe Val 
785 7 90 795 800 

Thr Trp lie Glu Gly Val Met Arg Asn Asn 

805 810 



<210> 141 

<211> 762 

<212> PRT 

<213> Homo sapiens 



<400> 141 

Gly Pro Asn lie 

1 

Leu Ala Val Ser 

20 

Leu Gly Ser Pro 
35 

Cys Ala Pro Glu 
50 

Glu Asp Arg Pro 
65 

Thr Gin Val Ser 

Ser Lys Asn Phe 

100 

Asp lie Tyr Tyr 
115 

Trp Ser He Gin 
130 

Leu Thr Ser Asn 
145 

Val Ser Pro Tyr 

Cys Tyr Asp Met 

180 

Val Leu Thr Leu 
195 

Lys Gin Ser Val 
210 

Ala He Met Gin 
225 

Asp Ala Ser His 

Ala Leu Asp Gly 

260 

Cys His Val Gly 
275 

Tyr Pro Ser Leu 
290 

Asn Leu He Phe 
305 

Tyr Ser Glu Leu 

Ser Ser Asn Val 

340 

Ser Lys Val Glu 
355 

Ser Phe Asn Ala 
370 

Ser Cys Met Gly 
385 



Cys Thr Thr Arg 

5 

Pro Met Cys Ala 

Arg Cys Asp Leu 

40 

Ser He Glu Phe 
55 

Leu Ser Asp Lys 
7 0 

Pro Gin Arg He 
85 

Ser He Gin Val 

Leu Met Asp Leu 

120 

Asn Leu Gly Thr 
135 

Leu Arg lie Gly 
150 

Met Tyr He Ser 
165 

Lys Thr Thr Cys 

Thr Asp Gin Val 

200 

Ser Arg Asn Arg 
215 

Ala Thr Val Cys 
230 

Leu Leu Val Phe 
245 

Arg Leu Ala Gly 

Ser Asp Asn His 

280 

Gly Leu Met Thr 
295 

Ala Val Thr Glu 
310 

He Pro Gly Thr 
325 

Leu Gin Leu He 

Leu Glu Val Arg 

360 

Thr Cys Leu Asn 
375 

Leu Lys He Gly 
390 



Gly Val Ser Ser 
10 

Trp Cys Ser Asp 
2 5 

Lys Glu Asn Leu 

Pro Val Ser Glu 

60 

Gly Ser Gly Asp 
75 

Ala Leu Arg Leu 
90 

Arg Gin Val Glu 
105 

Ser Tyr Ser Met 

Lys Leu Ala Thr 

140 

Phe Gly Ala Phe 
155 

Pro Pro Glu Ala 
170 

Leu Pro Met Phe 
185 

Thr Arg Phe Asn 

i 

Asp Ala Pro Glu 

220 

Asp Glu Lys He 
235 

Thr Thr Asp Ala 
250 

He Val Gin Pro 
265 

Tyr Ser Ala Ser 

Glu Lys Leu Ser 

300 

Asn Val Val Asn 
315 

Thr Val Gly Val 
330 

Val Asp Ala Tyr 
345 

Asp Leu Pro Glu 

Asn Glu Val He 

380 

Asp Thr Val Ser 
395 



Cys Gin Gin Cys 
15 

Glu Ala Leu Pro 
30 

Leu Lys Asp Asn 
45 

Ala Arg Val Leu 

Ser Ser Gin Val 

80 

Arg Pro Asp Asp 
95 

Asp Tyr Pro Val 
110 

Lys Asp Asp Leu 
125 

Gin Met Arg Lys 

Val Asp Lys Pro 

160 

Leu Glu Asn Pro 
175 

Gly Tyr Lys His 
190 

Glu Glu Val Lys 
205 

Gly Gly Phe Asp 

Gly Trp Arg Asn 

240 

Lys Thr His He 
255 

Asn Asp Gly Gin 
270 

Thr Thr Met Asp 
285 

Gin Lys Asn He 

Leu Tyr Gin Asn 

320 

Leu Ser Met Asp 
335 

Gly Lys He Arg 
350 

Glu Leu Ser Leu 
365 

Pro Gly Leu Lys 

Phe Ser He Glu 

400 



WO 2004/113522 



93 



PCT/EP2004/051 1 73 



Ala Lys Val Arg 

Lys Pro Val Gly 

420 

Cys Asp Cys Ala 
435 

Asn Asn Gly Asn 
450 

Gly Trp lieu Gly 
465 

Ser Gin Gin Asp 

Gin Arg Gly Glu 

500 

Phe Gly Lys lie 
515 

Val Arg Tyr Lys 
530 

Gly Asp Cys Leu 
545 

Thr Thr Arg Thr 

Gly Arg Gly Lys 

580 

Ser Tyr Gly Asp 
595 

Thr Phe Lys Lys 
610 

Tyr Met Thr Glu 
625 

Ser Val Lys Glu 

Tyr Lys Asn Glu 

660 

Ser Ser Gly Lys 
675 

Lys Gly Pro Asp 
690 

Leu Leu lie Gly 
705 

lie His Asp Arg 

Ala Lys Trp Asp 

740 

Thr Phe Thr Asn 

755 



Gly 


Cys 


Pro 


Gin 


4 05 








Phe 


Lys 


Asp 


Ser 


Cys 


Gin 


Ala 


Gin 








440 


Gly 


Thr 


Phe 


Glu 






455 




Ser 


Gin 


Cys 


Glu 




470 






Glu 


Cys 


Ser 


Pro 


485 








Cys 


Leu 


Cys 


Gly 


Thr 


Gly Lys 


Tyr 








520 


Gly 


Glu 


Met 


Cys 






535 




Cys 


Asp 


Ser 


Asp 




550 






Asp 


Thr 


Cys 


Met 


565 








Cys 


Glu 


Cys 


Gly 


Thr 


Cys 


Glu 


Lys 








600 


Glu 


Cys 


Val 


Glu 






615 




Asn 


Thr 


Cys 


Asn 




630 






Leu 


Lys 


Asp 


Thr 


645 








Asp 


Asp 


Cys 


Val 


Ser 


lie 


Leu 


Tyr 








680 


lie 


Leu 


Val 


Val 






695 




Leu 


Ala 


Ala 


Leu 




710 






Lys 


Glu 


Phe 


Ala 


725 








Thr 


Ala 


Asn 


Asn 


lie 


Thr 


Tyr 


Arg 








760 



Glu Lys Glu Lys 
410 

Leu lie Val Gin 
425 

Ala Glu Pro Asn 

Cys Gly Val Cys 

460 

Cys Ser Glu Glu 
475 

Arg Glu Gly Gin 
490 

Gin Cys Val Cys 
505 

Cys Glu Cys Asp 

Ser Gly His Gly 

540 

Trp Thr Gly Tyr 
555 

Ser Ser Asn Gly 
570 

Ser Cys Val Cys 
585 

Cys Pro Thr Cys 

Cys Lys Lys Phe 

620 

Arg Tyr Cys Arg 
635 

Gly Lys Asp Ala 
650 

val Arg Phe Gin 
665 

Val Val Glu Glu 

Leu Leu Ser Val 

700 

Leu lie Trp Lys 
715 

Lys Phe Glu Glu 
730 

Pro Leu Tyr Lys 
745 

Gly Thr 



Ser 


Phe 


Thr 


He 






415 




Val 


Thr 


Phe 


Asp 




430 






Ser 


His 


Arg' Cys 


44 5 








Arg 


Cys 


Gly 


Pro 


Asp 


Tyr 


Arci 


Pro 








480 


Pro 


Val 


Cys 


Ser 






495 




His 


Ser 


Ser 


Asp 




510 






Asp 


Phe 


Ser 


Cys 


525 








Gin 


Cys 


Ser 


Cys 


Tyr 


Cys 


noil 










560 


Leu 


Leu 


Cys 


Ser 






575 




lie 


Gin 


Pro 


Gly 




590 






Pro 


Asp 


Ala 


Cys 


605 








Asp 


Arg 


Glu 


Pro 


Asp 


Glu 


He 


Glu 








640 


Val 


Asn 


Cys 


Thr 






655 




Tyr 


Tyr 


Glu 


Asp 




670 






Pro 


Glu 


Cys 


Pro 


685 








Met 


Gly 


Ala 


He 


Leu 


Leu 


He 


Thr 








720 


Glu 


Arg 


Ala 


Arg 






735 




Glu 


Ala 


Thr 


Ser 



<210> 142 

<211> 505 

<212> PRT 

<213> Homo sapiens 

<400> 142 

Gin Thr Ser Val Ser Pro Ser Lys 
1 5 
Val Leu Val Thr Cys Ser Thr Ser 

20 

He Glu Thr Pro Leu Pro Lys Lys 

35 40 
Arg Lys Val Tyr Glu Leu Ser Asn 

50 55 
Cys Tyr Ser Asn Cys Pro Asp Gly 



Val He Leu Pro Arg Gly Gly Ser 

10 15 
Cys Asp Gin Pro Lys Leu Leu Gly 
25 30 
Glu Leu Leu Leu Pro Gly Asn Asn 

45 

Val Gin Glu Asp Ser Gin Pro Met 

60 

Gin Ser Thr Ala Lys Thr Phe Leu 
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65 










70 










75 










80 


Thr 


Val 


Tyr Trp 


Thr 


Pro 


Glu 


Arg 


Val 


Glu 


Leu 


Ala 


Pro 


Leu 


Pro 


Ser 










85 










90 










95 




Trp 


Gin 


Pro 


Val 


Gly Lys 


Asn 


Leu 


Thr 


Leu Arg 


Cys 


Gin 


Val 


Glu 


Gly 








100 










105 










110 




Gly Ala 


Pro 


Arg 


Ala 


Asn 


Leu 


Thr 


Val 


Val 


Leu 


Leu 


Arg 


Gly 


Glu 


Lys 






115 










120 










125 






Glu 


Leu 


Lys 


Arg 


Glu 


Pro 


Ala 


Val 


Gly Glu 


Pro 


Ala 


Glu 


Val 


Thr 


Thr 




13 0 










135 










140 










Thr 


Val 


Leu 


Val 


Arg 


Arg 


Asp 


His 


His 


Gly Ala Asn 


Phe 


Ser 


Cys 


Arg 


145 










150 










155 










160 


Thr 


Glu 


Leu 


Asp 


Leu 


Arg 


Pro Gin Gly Leu Glu Leu Phe Glu 


Asn 


Thr 










165 










170 










175 




Ser 


Ala 


Pro 


Tyr 
18 0 


Gin 


Leu 


Gin 


Thr 


Phe 
185 


Val 


Leu 


Pro 


Ala 


Thr 
190 


Pro 


Pro 


Gin 


Leu 


Val 


Ser 


Pro 


Arg 


Val 


Leu 


Glu 


Val 


Asp 


Thr 


Gin Gly 


Thr 


Val 






195 










200 










205 








Val 


Cys 
210 


Ser 


Leu 


Asp 


Gly 


Leu 
215 


Phe 


Pro 


Val 


Ser 


Glu 
220 


Ala 


Gin 


Val 


His 


Leu 


Ala 


Leu 


Gly 


Asp 


Gin 


Arg 


Leu 


Asn 


Pro 


Thr 


Val 


Thr 


Tyr 


Gly Asn 


225 










230 










235 










24 0 


Asp 


Ser 


Phe 


Ser 


Ala 
245 


Lys 


Ala 


Ser 


Val 


Ser 
250 


Val 


Thr 


Ala 


Glu 


Asp 
255 


Glu 


Gly Thr 


Gin 


Arg 


Leu 


Thr 


Cys 


Ala 


Val 


He 


Leu 


Gly Asn 


Gin 


Ser 


Gin 








260 










265 










270 






Glu 


Thr 


Leu 
275 


Gin 


Thr 


Val 


Thr 


lie 
280 


Tyr 


Ser 


Phe 


Pro 


Ala 
285 


Pro 


Asn 


Val 


lie 


Leu 
290 


Thr 


Lys 


Pro 


Glu 


Val 
295 


Ser 


Glu 


Gly 


Thr 


Glu 
300 


Val 


Thr 


Val 


Lys 


Cys 


Glu 


Ala 


His 


Pro 


Arg 


Ala 


Lys 


Val 


Thr 


Leu 


Asn Gly Val 


Pro 


Ala 


305 










310 










315 










320 


Gin 


Pro 


Leu 


Gly 


Pro 
325 


Arg 


Ala 


Gin 


Leu 


Leu 
330 


Leu 


Lys 


Ala 


Thr 


Pro 
335 


Glu 


Asp Asn 


Gly Arg 


Ser 


Phe 


Ser 


Cys 


Ser 


Ala 


Thr 


Leu 


Glu 


Val 


Ala 


Gly 








340 










345 










350 






Gin 


Leu 


lie 
355 


His 


Lys 


Asn 


Gin 


Thr 
360 


Arg 


Glu 


Leu 


Arg 


Val 
365 


Leu 


Tyr 


Gly 


Pro 


Arg 


Leu 


Asp 


Glu Arg 


Asp 


Cys 


Pro 


Gly Asn 


Trp 


Thr 


Trp 


Pro 


Glu 




370 










375 










380 










Asn 


Ser 


Gin 


Gin 


Thr 


Pro 


Met 


Cys 


Gin 


Ala 


Trp 


Gly Asn 


Pro 


Leu 


Pro 


385 










390 










395 










400 


Glu 


Leu 


Lys 


Cys 


Leu 
405 


Lys 


Asp 


Gly 


Thr 


Phe 
410 


Pro 


Leu 


Pro 


He 


Gly 
415 


Glu 


Ser 


val 


Thr 


Val 


Thr 


Arg 


Asp 


Leu 


Glu 


Gly Thr 


Tyr 


Leu 


Cys 


Arg 


Ala 








420 










425 










430 






Arg 


Ser 


Thr 


Gin 


Gly Glu 


Val 


Thr 


Arg 


Glu 


Val 


Thr 


Val 


Asn 


Val 


Leu 






435 










440 










445 








Ser 


Pro 
450 


Arg 


Tyr 


Glu 


lie 


Val 
455 


lie 


lie 


Thr 


Val 


Val 
460 


Ala 


Ala 


Ala 


Val 


lie 


Met 


Gly 


Thr 


Ala 


Gly 


Leu 


Ser 


Thr 


Tyr 


Leu 


Tyr 


Asn 


Arg 


Gin 


Arg 


465 










470 










475 










480 


Lys 


lie 


Lys 


Lys 


Tyr 
485 


Arg 


Leu 


Gin 


Gin 


Ala 
490 


Gin 


Lys 


Gly 


Thr 


Pro 
495 


Met 


Lys 


Pro 


Asn 


Thr 
500 


Gin 


Ala 


Thr 


Pro 


Pro 
505 

















<210> 143 

<211> 261 

<212> PRT 

<213> Homo sapiens 

<400> 143 
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Met lie Glu Thr 
1 

Leu Pro lie Ser 

20 

lie Thr Gin Met 
35 

Arg Leu Asp Lys 
50 

Phe Met Lys Thr 
65 

Leu Leu Asn Cys 

Asp lie Met Leu 

100 

Met Gin Lys Gly 
115 

Glu Ala Ser Ser 
130 

Tyr Tyr Thr Met 
145 

Leu Thr Val Lys 

Phe Cys Ser Asn 

180 

Leu Cys Leu Lys 
195 

Ala Asn Thr His 
210 

Leu Gly Gly Val 
225 

Val Thr Asp Pro 

Gly Leu Leu Lys 

260 



Tyr Asn Gin Thr 
5 

Met Lys lie Phe 

lie Gly Ser Ala 

40 

lie Glu Asp Glu 
55 

lie Gin Arg Cys 
70 

Glu Glu lie Lys 
85 

Asn Lys Glu Glu 

Asp Gin Asn Pro 

120 

Lys Thr Thr Ser 
135 

Ser Asn Asn Leu 
150 

Arg Gin Gly Leu 
165 

Arg Glu Ala Ser 

Ser Pro Gly Arg 

200 

Ser Ser Ala Lys 
215 

Phe Glu Leu Gin 
230 

Ser Gin Val Ser 

245 

Leu 



Ser Pro Arg Ser 
10 

Met Tyr Leu Leu 
25 

Leu Phe Ala Val 

Arg Asn Leu His 

60 

Asn Thr Gly Glu 
75 

Ser Gin Phe Glu 
90 

Thr Lys Lys Glu 
105 

Gin He Ala Ala 

Val Leu Gin Trp 

14 0 

Val Thr Leu Glu 
155 

Tyr Tyr He Tyr 
170 

Ser Gin Ala Pro 
185 

Phe Glu Arg lie 

Pro Cys Gly Gin 

220 

Pro Gly Ala Ser 
235 

His Gly Thr Gly 
250 



Ala Ala Thr Gly 
15 

Thr Val Phe Leu 
30 

Tyr Leu His Arg 
45 

Glu Asp Phe Val 

Arg Ser Leu Ser 

80 

Gly Phe Val Lys 
95 

Asn Ser Phe Glu 
110 

His Val He Ser 
125 

Ala Glu Lys Gly 

Asn Gly Lys Gin 

160 

Ala Gin Val Thr 
175 

Phe He Ala Ser 
190 

Leu Leu Arg Ala 
205 

Gin Ser He His 

Val Phe Val Asn 

240 

Phe Thr Ser Phe 
255 



<210> 144 

<211> 187 

<212> PRT 

<213> Homo sapiens 



<400> 144 



Ala 


Met 


His 


Val 


Ala 


Gin 


Pro 


Ala 


Val 


Val 


Leu 


Ala 


Ser 


Ser 


Arg 


Gly 


1 








5 










10 










15 


He 


Ala 


Ser 


Phe 


Val 


Cys 


Glu 


Tyr 


Ala 


Ser 


Pro 


Gly 


Lys 


Ala 


Thr 


Glu 








20 










25 










30 






Val 


Arg 


Val 


Thr 


Val 


Leu 


Arg 


Gin 


Ala 


Asp 


Ser 


Gin 


Val 


Thr 


Glu 


Val 






35 










40 










45 








Cys 


Ala 


Ala 


Thr 


Tyr 


Met 


Met 


Gly 


Asn 


Glu 


Leu 


Thr 


Phe 


Leu 


Asp 


Asp 




50 










55 










60 










Ser 


He 


Cys 


Thr 


Gly 


Thr 


Ser 


Ser 


Gly 


Asn 


Gin 


Val 


Asn 


Leu 


Thr 


He 


65 










70 










75 










80 


Gin 


Gly 


Leu 


Arg 


Ala 


Met 


Asp 


Thr 


Gly Leu Tyr 


He 


Cys 


Lys 


Val 


Glu 










85 










90 










95 




Leu 


Met 


Tyr 


Pro 


Pro 


Pro 


Tyr 


Tyr 


Leu 


Gly 


He 


Gly Asn 


Gly Thr Gin 








100 










105 










110 






He 


Tyr 


Val 


He 


Asp 


Pro 


Glu 


Pro 


Cys 


Pro 


Asp 


Ser 


Asp 


Phe 


Leu 


Leu 






115 










.120 










125 








Trp 


He 


Leu 


Ala 


Ala 


Val 


Ser 


Ser 


Gly 


Leu 


Phe 


Phe 


Tyr 


Ser 


Phe 


Leu 




130 










135 










140 










Leu 


Thr 


Ala 


Val 


Ser 


Leu 


Ser 


Lys 


Met 


Leu 


Lys 


Lys 


Arg 


Ser 


Pro 


Leu 


145 










150 










155 










160 


Thr 


Thr 


Gly Val 


Tyr 


Val 


Lys 


Met 


Pro 


Pro 


Thr 


Glu 


Pro 


Glu 


Cys 


Glu 
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165 170 
Lys Gin Phe Gin Pro Tyr Phe lie Pro lie Asn 

180 185 



175 



<210> 145 

. <211> 544 

<212> PRT 

<213> Homo sapiens 



<400> 145 



He 


Pro 


Pro 


His 


Val 


Gin 


Lys 


Ser 


l 








5 








Asp 


Asn 


Asn 


Gly 


Ala 


Val 


Lys 


Phe 








20 










Val 


Arg 


Phe 


Ser 


Thr 


Cys 


Asp Asn 






35 










40 


Ser 


He 


Thr 


Ser 


He 


Cys 


Glu 


Lys 




50 










55 




Trp 


Arg 


Lys 


Asn 


Asp 


Glu 


Asn 


He 


65 










70 






Pro 


Lys 


Leu 


Pro 


Tyr 


His 


Asp 


Phe 










85 








Lys 


Cys 


He 


Met 


Lys 


Glu 


Lys. 


Lys 








100 










Cys 


Ser 


Cys 


Ser 


Ser 


Asp 


Glu 


Cys 






115 










120 


Glu 


Tyr 


Asn 


Thr 


Ser 


Asn 


Pro 


Asp 




130 










135 




Thr 


Gly 


He 


Ser 


Leu 


Leu 


Pro 


Pro 


145 










150 






He 


lie 


Phe 


Tyr 


Cys 


Tyr 


Arg 


Val 










165 








Thr 


Trp 


Glu 


Thr 


Gly 


Lys 


Thr 


Arg 








180 










Cys 


Ala 


He 


He 


Leu 


Glu 


Asp 


Asp 






195 










200 


Ala 


Asn 


Asn 


He 


Asn 


His 


Asn 


Thr 




210 










215 




Thr 


Leu 


Val 


Gly 


Lys 


Gly 


Arg 


Phe 


225 










230 






Lys 


Gin 


Asn 


Thr 


Ser 


Glu 


Gin 


Phe 










245 








Pro 


Tyr 


Glu 


Glu 


Tyr 


Ala 


Ser 


Trp 








260 










Asp 


lie 


Asn 


Leu 


Lys 


His 


Glu 


Asn 






275 










280 


Glu 


Arg 


Lys 


Thr 


Glu 


Leu 


Gly 


Lys 




290 










295 




His 


Ala 


Lys 


Gly 


Asn 


Leu 


Gin 


Glu 


305 










310 






Trp 


Glu 


Asp 


Leu 


Arg 


Lye 


Leu 


Gly 










325 








His 


Leu 


His 


Ser 


Asp 


His 


Thr 


Pro 








340 










Val 


His 


Arg 


Asp 


Leu 


Lys 


Ser 


Ser 






355 










360 


Thr 


Cys 


Cys 


Leu 


Cys 


Asp 


Phe 


Gly 




370 










375 




Leu 


Ser 


Val 


Asp 


Asp 


Leu 


Ala 


Asn 


385 










390 






Tyr 


Met 


Ala 


Pro 


Glu 


Val 


Leu 


Glu 



Val 


Asn 


Asn 


Asp 


Met 


He 


Val 


Thr 




10 










15 




Pro 


Gin 


Leu 


Cys 


Lys 


Phe 


Cys 


Asp 


25 










30 






Gin 


Lys 


Ser 


Cys 


Met 


Ser 


Asn 


Cys 










45 








Pro 


Gin 


Glu 


Val 


Cys 


Val 


Ala 


val 








60 










Thr 


Leu 


Glu 


Thr 


Val 


Cys 


His 


Asp 






75 










80 


He 


Leu 


Glu 


Asp 


Ala 


Ala 


Ser 


Pro 




90 










95 




Lys 


Pro 


Gly Glu 


Thr 


Phe 


Phe 


Met 


105 










110 






Asn 


Asp 


Asn 


He 


He 


Phe 


Ser 


Glu 










125 








Leu 


Leu 


Leu 


Val 


He 


Phe 


Gin 


Val 








140 










Leu 


Gly Val 


Ala 


He 


Ser 


Val 


He 






155 










160 


Asn 


Arg 


Gin 


Gin 


Lys 


Leu 


Ser 


Ser 




170 










175 




Lys 


Leu 


Met 


Glu 


Phe 


Ser 


Glu 


His 


185 










190 






Arg 


Ser 


Asp 


He 


Ser 


Ser 


Thr 


Cys 










205 








Glu 


Leu 


Leu 


Pro 


He 


Glu 


Leu 


Asp 








220 










Ala 


Glu 


Val 


Tyr 


Lys 


Ala 


Lys 


Leu 






235 










240 


Glu 


Thr 


Val 


Ala 


Val 


Lys 


He 


Phe 




250 










255 




Lys 


Thr 


Glu 


Lys 


Asp 


He 


Phe 


Ser 


265 










270 






He 


Leu 


Gin 


Phe 


Leu 


Thr 


Ala 


Glu 










285 








Gin 


Tyr 


Trp 


Leu 


He 


Thr 


Ala 


Phe 








300 










Tyr 


Leu 


Thr 


Arg' 


His 


val 


He 


Ser 






315 










320 


Ser 


Ser 


Leu 


Ala 


Arg 


Gly 


He 


Ala 




330 










335 




Cys 


Gly Arg 


Pro 


Lys 


Met 


Pro 


He 


345 










350 






Asn 


He 


Leu 


Val 


Lys 


Asn 


Asp 


Leu 










365 








Leu 


Ser 


Leu 


Arg 


Leu 


Asp 


Pro 


Thr 








380 










Ser Gly Gin Val 


Gly 


Thr 


Ala 


Arg 






395 










400 


Ser 


Arg 


Met 


Asn 


Leu 


Glu 


Asn 


Ala 




410 










415 
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Glu 


Ser 


Phe 


Lys 








420 


Glu 


Met 


Thr 


Ser 






435 




Pro 


Pro 


Phe 


Gly 




450 






Lys 


Asp 


Asn 


Val 


465 








Trp 


lieu 


Asn 


His 


Cys 


Trp 


Asp 


His 








500 


Glu 


Arg 


Phe 


Ser 






515 




Cys 


Ser 


Glu 


Glu 




530 







Gin Thr Asp Val 

Arg Cys Asn Ala 

440 

Ser Lys Val Arg 
455 

Leu Arg Asp Arg 
470 

Gin Gly lie Gin 
485 

Asp Pro Glu Ala 

Glu Leu Glu His 

520 

Lys lie Pro Glu 
535 



Tyr Ser Met Ala 
425 

Val Gly Glu Val 

Glu His Pro Cys 

460 

Gly Arg Pro Glu 
475 

Met Val Cys Glu 
490 

Arg Leu Thr Ala 
505 

Leu Asp Arg Leu 

Asp Gly Ser Leu 

540 



Leu Val Leu Trp 
430 

Lys Asp Tyr Glu 
445 

Val Glu Ser Met 

lie Pro Ser Phe 

480 

Thr Leu Thr Glu 
495 

Gin Cys Val Ala 
510 

Ser Gly Arg Ser 
525 

Asn Thr Thr Lys 



<210> 146 

<211> 358 

<212> PRT 

<213> Homo sapiens 



<400> 146 

Cys Glu Glu Pro 

1 

Lys Pro Tyr Tyr 

20 

Gly Tyr Phe Tyr 
35 

Asn His Thr Trp 
50 

Cys Pro Tyr lie 
65 

Gly Thr Tyr Glu 

Tyr Tyr Leu lie 

100 

Val Ala lie Trp 
115 

Thr Pro Pro Pro 
130 

Glu Val Phe Glu 
145 

Pro Gly Pro Asp 

Gly Asp Asn Ser 

180 

Lys Cys Arg Phe 
195 

Gly Lys Lys Phe 
210 

Gly Phe Tyr Leu 
225 

Thr Trp Asp Pro 

Ser Thr Lys Pro 

260 

Thr Lys Ser Pro 
275 

Pro Pro Val Ser 
290 

Leu Asp Ser Leu 



Pro Thr Phe Glu 
5 

Glu lie Gly Glu 

lie Pro Pro Leu 

40 

Leu Pro Val Ser 
55 

Arg Asp Pro Leu 
70 

Phe Gly Tyr Gin 
85 

Gly Glu Glu lie 

Ser Gly Lys Pro 

120 

Lys lie Lys Asn 
13 5 

Tyr Leu Asp Ala 
150 

Pro Phe Ser Leu 
165 

Val Trp Ser Arg 

Pro Val Val Glu 

200 

Tyr Tyr Lys Ala 
215 

Asp Gly Ser Asp 
230 

Pro Val Pro Lys 
245 

Pro Ala Leu Ser 

Ala Ser Ser Ala 

280 

Asn Tyr Pro Gly 
295 

Asp Val Trp Val 



Ala Met Glu Leu 
10 

Arg Val Asp Tyr 
25 

Ala Thr His Thr 

Asp Asp Ala Cys 

60 

Asn Gly Gin Ala 
75 

Met His Phe He 
90 

Leu Tyr Cys Glu 
105 

Pro He Cys Glu 

Gly Lys His Thr 

140 

Val Thr Tyr Ser 
155 

He Gly Glu Ser 
170 

Ala Ala Pro Glu 
185 

Asn Gly Lys Gin 

Thr Val Met Phe 

220 

Thr He Val Cys 
235 

Cys Leu Lys Val 
250 

His Ser Val Ser 

265 

Ser Gly Pro Arg 

Tyr Pro Lys Pro 

300 

He Ala Val He 



He Gly Lys Pro 
15 

Lys Cys Lys Lys 
30 

He Cys Asp Arg 
45 

Tyr Arg Glu Thr 

Val Pro Ala Asn 

80 • 

Cys Asn Glu Gly 
95 

Leu Lys Gly Ser 
110 

Lys Val Leu Cys 
125 

Phe Ser Glu Val 

Cys Asp Pro Ala 

160 

Thr lie Tyr Cys 
175 

Cys Lys Val Val 
190 

He Ser Gly Phe 
205 

Glu Cys Asp Lys 

Asp Ser Asn Ser 

240 

Leu Pro Pro Ser 
255 

Thr Ser Ser Thr 
270 

Pro Thr Tyr Lys 
285 

Glu Glu Gly He 
Val He Ala He 
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305 

Val Val Gly Val 

Arg Arg Lye Lys 

340 

Val Lys Phe Thr 
355 



310 

Ala Val lie Cys 
325 

Lys Gly Thr Tyr 
Ser Leu 



98 

315 

Val Val Pro Tyr 
330 

Leu Thr Asp Glu 
345 



320 

Arg Tyr Leu Gin 
335 

Thr His Arg Glu 
350 



<210> 147 

<211> 1148 

<212> PRT 

<213> Homo sapiens 



<400> 147 

Leu Pro Glu Ala 

1 

Tyr Ala Val Gin 

20 

Gly Ser Pro Trp 
35 

Lys Cys Pro Val 
50 

Gin Thr Ser Thr 
65 

Ser Leu Gly Leu 

Thr Cys Gly Pro 

100 

Thr Gly Val Cys 
115 

Phe Ser Pro Ala 
130 

Val Cys Asp Glu 
145 

Phe Leu Glu Lys 

Gin Val Gly Leu 

180 

Leu Asn Thr Tyr 
195 

Thr Ser Gin Tyr 
210 

Tyr Ala Arg Lys 
225 

Ala Thr Lys Val 

Ser Met Leu Lys 

260 

Arg Phe Gly lie 
275 

Thr Lys Asn Leu 
290 

Glu Arg Tyr Phe 
305 

Ala Gly Thr Leu 

Gly Gly A3p Asn 

340 

Asp Tyr Ser Ser 
355 

Phe Gly Trp Ser 
370 

lie Phe Pro Lys 



Lys lie Phe Ser 
5 

Gin Phe He Asn 

Ser Gly Phe Pro 

40 

Asp Leu Ser Thr 
55 

Ser He Pro Asn 
70 

He Leu Thr Arg 
85 

Leu Trp Ala Gin 

Ser Asp He Ser 

120 

Thr Gin Pro Cys 
135 

Ser Asn Ser He 
150 

Phe Val Gin Gly 
165 

He Gin Tyr Ala 

Lys Thr Lys Glu 

200 

Gly Gly Asp Leu 
215 

Tyr Ala Tyr Ser 
230 

Met Val Val Val 
245 

Ala Val He Asp 

Ala Val Leu Gly 

280 

He Lys Glu He 
295 

Phe Asn Val Ser 
310 

Gly Glu Gin He 
325 

Phe Gin Met Glu 

Gin Asn Asp He 

360 

Gly Thr He Val 
375 

Gin Ala Phe Asp 



Gly Pro Ser Ser 
10 

Pro Lys Gly Asn 

25 

Glu Asn Arg Met 

Ala Thr Cys Glu 

60 

Val Thr Glu Met 
75 

Asn Met Gly Thr 
90 

Gin Cys Gly Asn 
105 

Pro Asp Phe Gin 

Pro Ser Leu He 

140 

Tyr Pro Trp Asp 
155 

Leu Asp He Gly 
170 

Asn Asn Pro Arg 
185 

Glu Met He Val 

Thr Asn Thr Phe 

220 

Ala Ala Ser Gly 
235 

Thr Asp Gly Glu 
250 

Gin Cys Asn His 
265 

Tyr Leu Asn Arg 

Lys Ala He Ala 

300 

Asp Glu Ala Ala 
315 

Phe Ser He Glu 
330 

Met Ser Gin Val 
345 

Leu Met Leu Gly 

Gin Lys Thr Ser 

380 

Gin He Leu Gin 



Glu Gin Phe Gly 
15 

Trp Leu Leu Val 
30 

Gly Asp Val Tyr 
45 

Lys Leu Asn Leu 

Lys Thr Asn Met 

80 

Gly Gly Phe Leu 
95 

Gin Tyr Tyr Thr 
110 

Leu Ser Ala Ser 
125 

Asp Val Val Val 

Ala Val Lys Asn 

160 

Pro Thr Lys Thr 
175 

Val Val Phe Asn 
190 

Ala Thr Ser Gin 
205 

Gly Ala He Gin 

Gly Arg Arg Ser 

240 

Ser His Asp Gly 
255 

Asp Asn He Leu 
270 

Asn Ala Leu Asp 
285 

Ser He Pro Thr 

Leu Leu Glu Lys 

320 

Gly Thr Val Gin 
335 

Gly Phe Ser Ala 
350 

Ala Val Gly Ala 
365 

His Gly His Leu 
Asp Arg Asn His 
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r\ 

385 










390 










Ser 


Ser 


Tyr 


Leu 


Gly 


Tyr 


Ser 


Val 


Ala 


Ala 










405 










410 


Thr 


His 


Phe 


Val 


Ala 


Gly Ala 


Pro Arg 


Ala 








420 










425 




Val 


Leu 


Tyr 


Ser 


Val 


Asn 


Glu 


Asn Gly Asn 






435 










440 






His 


Arg Gly Asp Gin 


He 


Gly 


Ser 


Tyr 


Phe 




450 










455 








Val 


Asp 


Val 


Asp 


Lys 


Asp 


Thr 


He 


Thr 


Asp 


465 










470 










Pro 


Met 


Tyr 


Met 


Ser 


Asp 


Leu 


Lys 


Lys 


Glu 










485 










490 


Phe 


Thr 


He 


Lys 


Lys 


Gly 


He 


Leu Gly Gin 








500 










505 




Pro 


Glu 


Gly 


He 


Glu 


Asn 


Thr 


Arg 


Phe 


Gly 






515 










520 






Ser 


Asp 


He 


Asn 


Met 


Asp 


Gly 


Phe 


Asn 


Asp 




530 










535 








Leu 


Glu 


Asn 


Gin 


Asn 


Ser 


Gly 


Ala 


Val 


Tyr 


545 










550 










Gly 


Thr 


He 


Arg 


Thr 


Lys 


Tyr 


Ser 


Gin 


Lys 










565 










570 


Ala 


Phe 


Arg 


Ser 


His 


Leu 


Gin 


Tyr 


Phe 


Gly 








580 










585 




Gly Asp 


Leu 


Asn Gly 


Asp 


Ser 


lie 


Thr 


Asp 






595 










600 






Gly 


Gin 


Val 


Val 


Gin 


Leu 


Trp 


Ser 


Gin 


Ser 




610 










615 








Glu 


Ala 


Ser 


Phe 


Thr 


Pro 


Glu 


Lys 


He 


Thr 


625 










630 










Gin 


He 


He 


Leu 
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